(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization fa 
International Bureau 



(43) International Publication Date 
4 May 2006 (04.05.2006) 



PCT 



1 1 iiiii iieiiiii h mm urn urn uni mi i rr m urn iiifi mu mir run rm inrifi mi nn mi 

(10) International Publication Number 

WO 2006/047589 A2 



(51) International Patent Classification: 

C12N 9/90 (2006.01) C12N 15/10 (2006.01) 



si Filing Date: 25 October 2005 (25.10.2005) 

(25) Filing Language: English 

(26) Publication Language: English 

(30) Priority Data: 

60/622,206 25 October 2004 (25.10.2004) US 

(71) Applicant (for all designated States except US): 
CODEXIS, INC. [US/US]; 515 Galveston Drive, 
5 Redwood City, CA 940G3 (US). 

| (72) Inventors; and 

j (75) Inventors/Applicants (for US only): CHATTERJEE, 

\ Ran,iini [SG/US]; 2118 Arthur Avenue, Belmont, CA 

i 94002 (US). MITCHELL, Kenneth, W. [US/US]; 559 

\ Grand Fir Avenue, Unit 2, Sunnyvale, CA 94086 (US). 

| LOUIE, Susan, Y. LUS/USJ; 928 Visitacion Avenue, San 

i Francisco, CA 94134 (US). FOX, Richard, J. LUS/USJ; 

S 21 Ilomewood Drive, Kirkwood, MO 63 122 (US). CHEN, 

= Michelle [CN/US]; 2151 Carlmont Drive, Apt. 402, Bel- 

: mont, CA 94002 (US). 



(74) Agent: POCHOPDZN, Donald, .1.; McAndrews, Held & 
Malloy, Ltd.. 500 W. Madison Street, 34th Floor, Chicago, 
IL 60661 (US). 

(81) Designated States (unless otherwise indicated, for every 
kind of national protection available): AE, AG, AL, AM,- 
AT, AU, AZ, BA, BB, BG, BR, BW, BY, BZ, CA, CH, CN, 
CO, CR, CU, CZ, DE, DK, DM, DZ, EC, EE, EG, ES, FI, 
GB, GD, GE, GH, GM, HR, HU, ID, IL, IN, IS, JP, KB, 
KG, KM, KP, KR, KZ, LC, LK, LR, LS, LT, LU, LV, LY, 
MA, MD, MG, MK, MN, MW, MX, MZ, NA, NG, M, NO, 
NZ, OM, PG, PH, PL, PT, RO, RU, SC, SD, SE, SG, SK, 
SL, SM, SY, TJ, TM, TN, TR, TT, TZ, UA, UG, US, UZ, 
VC, VN, YU, ZA, ZM, ZW. 

(84) Designated States (unless otherwise indicated, for every 
kind of regional protection available): ARIPO (BW, GH, 
GM, KE, LS, MW, MZ, NA, SD, SL, SZ, TZ, UG, ZM, 
ZW), Eurasian (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), 
European (AT, BE, BG, CH, CY, CZ, DE, DK, EE, ES, FI, 
FR, GB, GR, HU, IE, IS, IT, LT, LU, LV, MC, NL, PL, PT, 
RO, SE, SI, SK, TR), OAPI (BF, BJ, CF, CG, CI, CM, GA, 
GN, GQ, GW, ML, MR, NE, SN, TD, TG). 

Published: 

— without international search report and to be republished 
upon receipt of that report 

tor two-letter codes and other abbreviations, refer to tlte "Guid- 
ance Notes on Codes and Abbreviations " appearing at tlie begin- 
ning of each regular issue of the PCT Gazette. 



: (54) Title: IMPROVED ALANINE 2,3-AMINOMUTASES AND RELATED POLYNUCLEOTIDES 



H 2 N" 



CH 2 



Alanine 2,3-aminomutase 



CH 3 



H 2 N- 



a-aianine p-alanine 

00 

J£) (57) Abstract: The present invention is directed to polypeptides that have enhanced alanine 2,3-aminomutase (AAM) activity and/or 
C thermostability relative to the wild-type enzymes that have incidental AAM activity as a result of cross reactivity with alanine. In 
^ addition, the present invention is directed to a polynucleotides that encodes for the AAM polypeptides of the present invention, to 
nucleic acid sequences comprising the polynucleotides, to expression vectors comprising the polynucleotides opcrativcly linked to 
^S, a promoter, to host cells transformed to express the AAM polypeptides, and to a method for producing the AAM polypeptides of the 
^ present invention. 



o 



EXHIBIT B: PAGE 1 OF 1 



WO 2006/047589 



PCT/US2005/038552 



Attorney Docket No.0359.2 1 0WO/1 5686WO02 

IMPROVED ALANINE 2 ; 3-AMINOMUTASES 
AND RELATED POLYNUCLEOTIDES 

FIELD OF THE INVENTION 
[01] The present invention is related to the field of enzymology, and particularly to 
the field of alanine 2,3-aminomutase (AAM) enzymology. More specifically, the 
present invention is directed to alanine 2,3-aminomutase polypeptides having 
improved enzymatic activity (i.e., high substrate turnover) and stability, and to 
polynucleotides sequences encoding for the improved alanine 2,3-aminomutase 
polypeptides. The present invention is useful because the alanine 2,3-aminomutase 
polypeptides can be coupled to other enzymes to produce synthetic organic chemicals, 
such as pantothenic acid or 3-hydroxypropionic acid in high yields. 

BACKGROUND OF THE INVENTION 
[021 Organic chemicals such as organic acids, esters, and polyols can be used to 
synthesize plastic materials and other products. To meet the increasing demand for 
organic chemicals, more efficient and cost-effective production methods are being 
developed which utilize raw materials based on carbohydrates rather than 
hydrocarbons. For example, certain bacteria have been used to produce large 
quantities of lactic acid used in the production of polylactic acid. 
[03] 3-hydroxypropionic acid (3-HP) is an organic acid. Several chemical synthesis 
routes have been described to produce 3-HP, and biocatalytic routes have also been 
disclosed (WO 01/16346 to Suthers et al.). 3-HP has utility for specialty synthesis 
and can be converted to commercially important intermediates by known methods in 
the chemical industry, e.g., acrylic acid by dehydration, malonic acid by oxidation, 
esters by esterification reactions with alcohols, and 1,3-propanediol by reduction. 
[04] The compound 3-HP can be produced biocatalytically from PEP or pyruvate, 
through a key beta-alanine intermediate (FIG. 1). Beta-alanine can be synthesized in 
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cells from carnosine, beta-alanyl arginine, beta-alanyl lysine, uracil via 5,6- 
dihydrouracil and N-carbamoyl-beta-alanine, N-acetyl-beta-alanine, anserine, or 
aspartate. However, these routes are commercially unviable because they require rare 
precursors or starting compounds that are more valuable than 3-HP. 
Therefore, production of 3-HP using biocatalytic routes would be more efficient if 
alpha-alanine could be converted to beta-alanine directly (FIG. 1). Unfortunately, a 
naturally occurring enzyme that inter-converts alpha-alanine to beta-alanine has not 
yet been identified. It would be advantageous if enzymatic activities that carry out the 
conversion of alpha-alanine to beta-alanine were identified, such as an alanine 2,3- 
aminomutase. Accordingly, it is one object of the present invention to identify 
enzymes with improved alanine 2-3-aminomutase activity. 

[05] Lysine 2,3-aminomutase (KAM), which catalyzes the anaerobic 
interconversion of lysine to beta-lysine, was first described by Barker in Clostridium 
SB4 (now C. subterminale) catalyzing the first step in the fermentation of lysine. 
KAM has been purified from C. subterminale, the gene cloned and expressed in E. 
coli. See e.g., U.S. Pat. 6,248,874, which issued on June 19, 2001 to Frey et al., the 
whole of which is hereby incorporated herein by reference. The specific activity of 
purified KAM from C. subterminale SB4 cells has been reported as 30-40 units/mg 
(Lieder et. al., Biochemistry 37:2578 (1998)), where a unit is defined as pinoles 
lysine/min. The corresponding purified recombinantly produced KAM had equivalent 
enzyme activity (34.5 ± 1.6 umoles lysine/min/mg protein). See U.S. Patent 
Application Publication No. 2003/0113882 Al, which published on June 19, 2003 to 
Frey et al, the whole of which is incorporated herein by reference. 
[06] Based upon the sequence of the KAM from C. subterminale, KAM genes have 
been annotated in the genomes of other organisms. However, in most cases, the 
enzymatic activities of the polypeptides encoded by these genes have not been 
confirmed. Exceptions are the B. subtilis gene (Chen, D., Ruzicka, F.J., and Frey, 
P.A. (2000) Biochem. J. 348:539-549)), and the Porphyromonas gingivalis and F. 
nucleatum genes. The B. subtilis KAM, encoded by the yodO gene, is more resistant 
to O2 than the C. subterminale KAM, but it is markedly less active. As reported by 
Frey, the B. subtilis KAM has a specific activity of only 0.62 U/mg. 
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[07] C. subtenninale SB4 KAM has been reported to have some cross-reactivity 
with L-alanine, converting it into beta-alanine. See U.S. Patent Application 
Publication No. 2003/0113882 Al. WO 03/062173 and WO 02/42418 disclose the 
first reports of AAM activity based upon modification of kam genes. In these 
applications, the synthetic aam genes had AAM activity as detected by the 
complementation of a ApanD E. coli strain. However, because alanine is not the 
natural substrate for this enzyme, the activity for this conversion is substantially less 
than the activity for conversion of lysine — its natural substrate. The AAM activity 
of a variant of B. subtilis KAM that also had AAM activity at approximately 0.001 
U/mg. It is an object of the present invention to provide polynucleotides encoding a 
polypeptide having substantially enhanced AAM activity over that found in the wild- 
type enzymes. 
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SUMMARY OF THE INVENTION 
[08] The present invention has multiple aspects. In one aspect, the present 
invention is directed to polypeptides that catalyze the reaction of FIG. 1. In one 
embodiment of this first aspect, the present invention is directed to a polypeptide 
having alanine 2,3-aminomutase (AAM) activity, preferably as measured by the assay 
of Example 8, and, 

(a) having a polypeptide selected from the group consisting of SEQ ID NO: 2, 4, 6, 8, 
10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48 and 51; 

(b) having an amino acid sequence which has at least 98% homology, with the amino 
acid sequence selected from the group consisting of SEQ ID NO: 2, 22, 28, 32, and 
36; 

(c) having an amino acid sequence which has at least 99% homology, with the amino 
acid sequence selected from the group consisting of SEQ ED NO: 4, 6, 8, 12, 16, 24, 
26, 30, 34 and 40; 

(d) being a polypeptide encoded by a nucleic acid sequence which hybridizes under 
high stringency conditions with either (i) the nucleotide sequence of SEQ ID NO: 1, 
3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 41, 43, 45, 47 or 49; 
(ii) a subsequence of (i) of at least 100 nucleotides, or (iii) a complementary strand of 
(i) or (ii) (J. Sambrook, E. F. Fritsch, and T. Maniatis, 1989, Molecular Cloning, A 
Laboratory Manual, 2d edition, Cold Spring Harbor, N. Y.); or 

(e) being a variant of the polypeptide of (c) comprising a substitution, deletion, and/or 
insertion of one to six amino acids therefrom and having AAM activity from about 1 
to about 30 pM P-alanine produced /hour 1 cell OD atpH 7.0-7.6, 25°C. 

[09] Collectively, the polypeptides of (b) and (c) above are referred to herein as 
"homologous polypeptides." For purposes of the present invention, the degree of 
homology between two amino acid sequences is expressed as "percent homology," 
"percent identity," "% identity," "percent identical," and "% identical" are used 
interchangeably herein to refer to the percent amino acid sequence identity that is 
obtained by ClustalW analysis (version W 1.8 available from European 
Bioinformatics Institute, Cambridge, UK), counting the number of identical matches 
in the alignment and dividing such number of identical matches by the length of the 
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reference sequence, and using the following default ClustalW parameters to achieve 
slow/accurate pairwise optimal alignments - Gap Open Penalty: 10; Gap Extension 
Penalty-.O.IO; Protein weight matrix: Gonnet series; DNA weight matrix: IUB; 
Toggle Slow/Fast pairwise alignments = SLOW or FULL Alignment. 
[10] In one embodiment, the present invention is also directed to an AAM 
polypeptide as described herein in isolated and purified form. 

[11] In another embodiment, the present invention is directed to an AAM 
polypeptide as described herein in lyophilized form. 

[12] In yet another embodiment, the present invention is directed to a composition 
comprising an AAM polypeptide as described herein and a suitable carrier, typically a 
buffer solution, more typically an aqueous buffer solution having a pH between 6.0 
and 8.0. The composition may also be in a lyophilized form. 

[13] The novel AAM polypeptides of the present invention have significantly 
enhanced AAM activity relative to the wild-type KAJVI polypeptides from which they 
are ultimately derived. By significantly enhanced AAM activity is meant that the 
AAM polypeptide of the present invention has an AAM activity within the range of 
about 1 to about 32 uM p-alanine produced/hour 1 cell OD (units), preferably from 
about 10 to about 32 units, more preferably from about 20 to about 32 units; most 
preferably from about 25 to about 32 units. 

[14] Preferred AAM polypeptides of the present invention have an amino acid 

sequences of SEQ ID NOs: 2, 6, 12, 16, 20, 24, 28, 30, 32, 34, 38, 44, 46 or 48; more 

preferably they have an amino acid sequence of SEQ ID NOs: 6, 12, 28, 34, 46 or 48; 

most preferably, they have an amino acid sequence of SEQ ID NOs: 28 or 34. 

[15] One of the grandparent molecules is the KAM of Bacillus subtilis, which had 

no detectible AAM activity. The DNA encoding this grandparent molecule was 

modified as described in WO 03/062173, entitled "Alanine 2,3-aminomutase," to 

produce a polypeptide having a detectible alanine 2,3-aminomutase activity. 

[16] In the present application, the applicants utilized as one parent molecule a 

polynucleotide sequence of SEQ ID NO: 58, which encoded the 471 residue 

polypeptide of SEQ ID NO: 59 and which exhibited an AAM activity of 
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approximately .001 U/mg (units/ mg of cell mass). The molecule of SEQ ID NO: 59 
differs from the wild-type B. subtilis KAM, which had no detectible AAM activity, by 
having the following four (4) amino acid substitutions: L103M, M136V, Y140H and 
D339H. 

[17] In yet another embodiment, the present invention is directed to a polypeptide 
having from about 1 to about 32 units of AAM activity and typically varying from the 
polypeptide of SEQ ED NO: 59 by 1-7 amino acid residues, more typically by 1-6 
amino acid residues, even more typically by 1-5 amino acid residues, and most 
typically by 1-4 amino acid residues. 

[18] In its second aspect, the present invention is directed to a polynucleotide 
sequence that encodes for the correspondingly referenced AAM polypeptide. Given 
the degeneracy of the genetic code, the present invention is also directed to any 
polynucleotide that encodes for the above referenced AAM polypeptides of the 
present invention. In another preferred embodiment, the present invention is directed 
to certain specific polynucleotides of SEQ ED NOS: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 
21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47 and 49 that encode for the novel 
AAM polypeptides of SEQ ID NOS: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 
30, 32, 34, 36, 38, 40, 42, 44, 46, 48 and 51, respectively. Preferred polynucleotides 
encode for a polypeptide of SEQ ED NO: 2, 6, 12, 16, 20, 24, 28, 30, 32, 34, 38, 44, 
46 or 48; more preferably they encode a polypeptide of SEQ ID NO: 6, 12, 28, 34, 46 
or 48; most preferably, they have a polypeptide of sequence of SEQ ID NO: 28 or 34. 
[19] In a third aspect, the present invention is directed to a nucleic acid construct, a 
vector, or a host cell comprising a polynucleotide sequence encoding an AAM 
polypeptide of the present invention operatively linked to a promoter. 
[20] In a fourth aspect, the present invention is directed to a method of making an 
AAM polypeptide of the present invention comprising (a) cultivating a host cell 
transformed with a nucleic acid sequence encoding an AAM polypeptide of the 
present invention under conditions suitable for production of the polypeptide; and (b) 
providing glucose to the cultivated host cells under conditions suitable for the 
production of p-alanine. The p-alanine may be optionally recovered from the cells. 
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[21] In a fifth aspect, the present invention is directed to a method of producing b- 
alanine comprising (a) cultivating a host cell transformed with a nucleic acid sequence 
encoding an AAM polypeptide of the present invention under conditions suitable for 
production of the polypeptide; and (b) providing glucose to the cultivated host cells 
under conditions suitable for the production of b-alanine. The b-alanine may be 
optionally recovered from the cells. 
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BRIEF DESCRIPTION OF SEVERAL VIEWS OF THE DRAWINGS 
[22] FIG. 1 shows the reversible reaction between alpha-alanine [i.e., L-alanine or 
2-aminopropionic acid) and beta-alanine (3-aminopropionic acid) that is catalyzed by 
alanine 2,3-aminomutase. 

[23] FIG. 2 is a pathway for 3-hydroxypropionate (3-HP) synthesis from alpha- 
alanine, via beta-alanine as an intermediate. 

[24] FIG. 3 is a 4036 bp expression vector (pCKl 10900-1 Bla) of the present 
invention comprising a P15A origin of replication (P15A ori), a lad repressor, a CAP 
binding site, a lac promoter O ac )i a T7 ribosomal binding site (T7gl0 RBS), and a 
chloramphenicol resistance gene (camR). 

[25] FIGS. 4A-4J in combination provide an alignment chart of the amino acid 
sequences of four parental polypeptides that were used to produce the AAM of the 
present invention. The parental polypeptides were non-naturally occurring and 
derived in part from the KLAM of Clostrisium stricUandii (SEQ ID NO: 53), 
Porphyromonas gingivalis (SEQ ID NO: 55), Fusobacterium nucleatum (SEQ ID 
NO: 57), and Bacillus subtilis (SEQ ID NO: 59), respectively. The sequences of two 
wild-type KAM are disclosed in SEQ ID NOS: 60 (P GI2529467_G8_AAB81 159.1 J 
and 61 (P_GI2634361_EMB_CAB 13 860.1 J. A consensus sequence is also provided 
as SEQ ID NO: 62). 

[26] The foregoing summary, as well as the following detailed description of 
certain embodiments of the present invention, will be better understood when read in 
conjunction with the appended drawings. For the purpose of illustrating the 
invention, there is shown in the drawings, certain embodiments. It should be 
understood, however, that the present invention is not limited to the arrangements and 
instrumentality shown in the attached drawings. 
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DETAILED DESCRIPTION OF THE INVENTION 
[27] The present invention has multiple aspects. In one aspect, the present 
invention is directed to a polypeptide having alanine 2,3-aminomutase (AAM) 
activity, preferably as measured by the assay of Example 8, and 

(a) having a polypeptide selected from the group consisting of SEQ ID NO: 2, 4, 6, 8, 
10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48 and 51: 

(b) having an amino acid sequence which has at least 98% homology, with the amino 
acid sequence selected from the group consisting of SEQ ID NO: 2, 22, 28, 32, and 
36; 

(c) having an amino acid sequence which has at least 99% homology, with the amino 
acid sequence selected from the group consisting of SEQ ID NO: 4, 6, 8, 12, 16, 24, 
26,30, 34 and 40; 

(d) being a polypeptide encoded by a nucleic acid sequence which hybridizes under 
high stringency conditions with either (i) the nucleotide sequence of SEQ ID NO: 1, 
3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 41, 43, 45, 47 or 49; 
(ii) a subsequence of (i) of at least 1 00 nucleotides, or (iii) a complementary strand of 
(i) or (ii) (J. Sambrook, E. F. Fritsch, and T. Maniatis, 1989, Molecular Cloning, A 
Laboratory Manual, 2d edition, Cold Spring Harbor, N.Y.); or 

(e) being a variant of the polypeptide of (d) comprising a substitution, deletion, and/or 
insertion of one to six amino acids therefrom and having AAM activity from about 1 
to about 30 uM p-alanine produced /hour 1 cell OD at pH 7.0-7.6, 25°C. 

[28] Collectively, the polypeptides of (b) and (c) above are referred to herein as 
"homologous polypeptides." For purposes of the present invention, the degree of 
homology between two amino acid, sequences is expressed as "percent homology," 
"percent identity," "% identity," "percent identical," and "% identical" are used 
interchangeably herein to refer to the percent amino acid sequence identity that is 
obtained by ClustalW analysis (version W 1.8 available from European 
Bioinformatics Institute, Cambridge, UK), counting the number of identical matches 
in the alignment and dividing such number of identical matches by the length of the 
reference sequence, and using the following default ClustalW parameters to achieve 
slow/accurate pairwise optimal alignments - Gap Open Penalty: 10; Gap Extension 
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Penalty:0.10; Protein weight matrix: Gonnet series; DNA weight matrix: IUB; 
Toggle Slow/Fast pairwise alignments = SLOW or FULL Alignment. 
[29] AAM polypeptides are sensitive to oxygen and are preferably maintained and 
used in an oxygen deficient environment. If the AAM polypeptide becomes 
inactivated due to exposure to oxygen, it can be activated by anaerobic incubation 
with a sulfhydryl compound for one hour at 37°C in accordance with the method 
described in Chirpich, et al., Journal Biol. Chem., 245(7): 1778-1789 (1970), which is 
incorporated herein by reference in its entirely. AAM polypeptides of the present 
invention are preferably utilized in whole cell form (i.e., as a whole cell transformed 
with an AAM polynucleotide that is used under conditions such that the encoded 
AAM polypeptide is expressed in the cell) or alternatively, both isolated and utilized 
under anoxic conditions. AAM polypeptides of the present invention may be isolated, 
and optionally purified, under anaerobic conditions (e.g., under a nitrogen 
atmosphere) in accordance with the method described in Petrovich, et al., Journal 
Biol. Chem., 266(12):7656-7660 (1991), which describes the isolation and 
purification of lysine-2,3-aminomutase and which is incorporated herein by reference 
in its entirety. As used herein, the term "anoxic" refers to oxygen deficient. The 
AAM polypeptides in whole cell form or as isolated enzymes may be lyophilized. In 
yet another embodiment, the present invention is directed to a composition 
comprising an AAM polypeptide as described herein (e.g., in whole cell form or as an 
isolated polypeptide) and a suitable carrier, typically a buffer, more typically an 
aqueous buffer solution having a pH from about 6.0 to about 8.0. It is also within the 
scope of the present invention that the aqueous buffered composition be lyophilized to 
provide a composition in a lyophilized form, wherein the composition is reconstituted 
by the addition of an aqueous based composition. 

[30] In one embodiment, the present invention is also directed to an AAM 
polypeptide as described herein in isolated and purified form. 

[31] In another embodiment, the present invention is directed to an AAM 
polypeptide as described herein in lyophilized form. Lyophilization is performed 
using standard lyophilization equipment. Typically, a solution containing the 
polypeptide is dispensed in an appropriate sized vial, frozen and placed under reduced 
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pressure to cause the water to evaporate, leaving the lyophilized (freeze-dried) 
polypeptide behind. Prior to use, the lyophilized polypeptide is reconstituted with 
distilled water or an appropriate buffer solution. 

[32] In yet another embodiment, the present invention is directed to a composition 
comprising an AAM polypeptide as described herein and a suitable carrier, typically a 
buffer solution, more typically an aqueous buffer solution having a pH between 6.0 
and 8.0. The composition may also be in a lyophilized form. 

[33] The novel AAM polypeptides of the present invention have significantly 
enhanced AAM activity relative to the wild-type KAM polypeptides from which they 
are ultimately derived. By significantly enhanced AAM activity is meant that the 
AAM polypeptide of the present invention has an AAM activity within the range of 
about 1 to about 32 ^M (3-alanine produced/hour 1 cell OD (units), preferably from 
about 10 to about 32 units, more preferably from about 20 to about 32 units; most 
preferably from about 25 to about 32 units. 

[34] Table 1 provides a chart showing the AAM activities of the various AAM 
polypeptides of the present invention, identified by their clone number and SEQ ID 
NO. In Table 1, the OD 60 onm is reported at harvest after 5 hours (t=5) of incubation. 
Table 1 also reports the total uM of ^-alanine produced after 5 hours per 1 cell OD. 
Finally, the last column of Table 1 reports the rate of p-alanine (uM) produced/hr II 
cell OD. 
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Table 1 





Harvest 

ODgoonm 
t= 5 


uM p-alanine 
produced at 
t=5/1 cell OD 


Rate of 
p-alanine(uM) 
produced /hr 
1 Cell OD 


34 


1.0 


159.7 


31.9 


10 


3.7 


31.7 


6.3 


38 


4.0 


54.9 


11.0 


20 


3.0 


73.4 


14.7 


14 


3.7 


33.5 


7.7 


22 


2.2 


4.8 


1.0 


42 


5.0 


17.5 


3.5 


26 


3.7 


23.9 


4.8 


18 


4.7 


19.3 


3.9 


44 


2.9 


64.4 


12.9 


51 


3.7 


35.0 


7.0 


36 


3.0 


29.8 


6.0 


48 


1.1 


110.1 


22.0 


12 


4.7 


17.8 


3.6 


4 


3.7 


22.4 


4.5 


16 


1.0 


136.0 


19.4 


24 


1.4 


94.7 


18.9 


46 


1.7 


107.6 


20.7 


28 


1.5 


148.0 


29.2 


40 


1.4 


14.6 


2.9 


32 


1.6 


93.2 


13.6 


2 


1.5 


87.5 


17.5 


30 


2.7 


72.6 


14.3 


6 


1.7 


125.7 


23.0 



[35] Preferred AAM polypeptides of the present invention have an amino acid 
sequences of SEQ ID NOs: 2, 6, 12, 16, 20, 24, 28, 30, 32, 34, 38, 44, 46 or 48; more 
preferably they have an amino acid sequence of SEQ ID NOs: 6, 12, 28, 34, 46 or 48; 
most preferably, they have an amino acid sequence of SEQ ID NOs: 28 or 34. 

[36] The ultimate grandparent molecule is the KAM of Bacillus subtilis, which had 
no detectible AAM activity. The DNA encoding this grandparent molecule was 
modified as described in WO 03/062173, entitled "Alanine 2,3-aminomutase," to 
produce a polypeptide having a detectible alanine 2,3-aminomutase activity. 
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[37] In the present application, the applicants utilized as one parent molecule a 
polynucleotide of SEQ ID NO: 58, which encoded the 471 residue polypeptide of 
SEQ ID NO: 59 and which exhibited an AAM activity of approximately .001 U/mg 
(units/ mg of cell mass). The molecule of SEQ ID NO: 59 differs from the wild-type 
B. subtilis KAM (SEQ ID NO: 60), which had no detectible AAM activity, by having 
the following four (4) amino acid substitutions: L103M, M136V, Y140H and D339H. 
[38] Other grandparent molecules utilized as starting materials in the present 
invention were the DNA sequences from other microorganisms (e.g., Porphyromonas 
gingivalis, Fusobacterium nucleatum, and Clostridium sticklandii) that encoded a 
KAM polypeptide. These DNA sequences were modified using standard techniques 
to introduce point substitutions that ultimately produced a KAM polypeptide that also 
had a detectible cross-reactivity with a-alanine. One such parent molecule that was 
derived from Porphyromonas gingivalis is the polynucleotide of SEQ ID NO: 54 
which encodes the 416 residue polypeptide of SEQ ID NO: 55. The parental 
polypeptide of SEQ ID NO: 55 differs from the wild-type Porphyromonas gringivalis 
KAM by having the following seven (7) amino acid substitutions: N19Y^ E30K, 
L53P, H85Q, II 92V, D331G, and M342T. Another such parent molecule that was 
derived from F. nucleatum is the polynucleotide of SEQ ID NO: 56 which encodes 
the 425 residue polypeptide of SEQ ID NO: 57. 

[39] Yet another parent polynucleotide was derived by modification, of the 
polynucleotide in C. stricklandii that encodes KAM. The resulting parental 
polynucleotide, which has a detectable cross-reactivity with a-alanine, is the 
polynucleotide of SEQ ID NO: 52 which encodes the 416 residue polypeptide of SEQ 
ID NO: 53. 

[40] The above described parental polypeptides of SEQ ED NOs: 53, 55, 57 and 58 
are compared in the alignment chart of FIG. 4. From the alignment chart, it can be 
seen that the KAMs from P. gingivalis, C. stricklandii, and F. nucleatum are truncated 
at the N-terminus and at the C-terminus relative to the KAM from B. subtilis, while 
between the four species, about 40% of the residue positions in the central portion of 
the KAM polypeptide are conserved. Based upon the truncated species in the 
aUgnment chart of FIG. 4, it can be inferred that the first 8 amino acid residues at the 
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N-terminus of SEQ ID NO: 58 and the last 40 residues at the C-terminus of SEQ ID 
NO: 58 are not necessary for KAM activity, or the AAM activity that is derived 
therefrom. In FIG. 4, there is also provided a consensus sequence. 
[41] The AAM polypeptide molecules of the present invention with their enhanced 
AAM activity were made by applying directed evolution techniques to the above- 
described parental molecules. These techniques are described in further detail herein. 
[42] In yet another aspect, the present invention is directed to AAM polypeptides 
that have enhanced activity in coupled reactions. 

[43] In another embodiment, the present invention is directed to an AAM a 
polypeptide encoded by a nucleic acid sequence which hybridizes under high 
stringency conditions with either (i) the nucleotide sequence of SEQ E) NO: 1, 3, 5, 
7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 41, 43, 45, 47 or 49; (ii) a 
subsequence of (i) of at least 100 nucleotides, or (iii) a complementary strand of (i) or 
(ii) (J. Sambrook, E. F. Fritsch, and T. Maniatis, 1989, Molecular Cloning, A 
Laboratory Manual, 2d edition, Cold Spring Harbor, N.Y.). For polynucleo-tides of at 
least 100 nucleotides in length, low to very high stringency conditions are defined as 
prehybridization and hybridization at42°C in 5x SSPE, 0.3% SDS, 200 ug/ml sheared 
and denatured salmon sperm DNA, and either 25% formamide for low stringencies, 
35% formamide for medium and medium-high stringencies, or 50% formamide for 
high and very high stringencies, following standard Southern blotting procedures. 
[44] For polynucleotides of at least 100 nucleotides in length, the carrier material is 
finally washed three times each for 15 minutes using 2x SSC, 0.2% SDS at least at 
50°C (low stringency), at least at 55°C (medium stringency), at least at 60°C. 
(medium-high stringency), at least at 65°C (high stringency), and at least at 70°C. 
(very high stringency). 

[45] In another embodiment, the present invention is directed to a variant of the 
polypeptide of (d) comprising a substitution, deletion, and/or insertion of one to six 
amino acids there-from and having AAM activity from about 1 to about 30 uM |3- 
alanine produced /hour 1 cell OD at pH 7.0-7.6, 25 °C, such as determined by the 
method of Example 8. Preferably, amino acid changes are of a minor natuire, that is 
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conservative amino acid substitutions that do not significantiy affect the folding 
and/or activity of the protein; small deletions, typically of one to six amino acids; 
small amino- or carboxyl-terminal extensions; a small linker peptide; or a small 
extension that facilitates purification by changing net charge or another function, such 
as a poly-histidine tract, an antigenic epitope or a binding domain. 
[46] Examples of conservative substitutions are wilhin the group of basic amino 
acids (arginine, lysine and histidine), acidic amino acids (glutamic acid and aspartic 
acid), polar amino acids (glutamine and asparagine), hydrophobic amino acids 
(leucine, isoleucine and valine), aromatic amino acids (phenylalanine, tryptophan and 
tyrosine), and small amino acids (glycine, alanine, serine, threonine, prroline, cysteine 
and methionine). Amino acid substitutions, which do not generally alter the specific 
activity are known in the art and are described, for example, by H. Neurath and R. L. 
Hill, 1979, In, The Proteins, Academic Press, New York. The roost commonly 
occurring exchanges are Ala/Ser, Val/fle, Asp/Glu, Thr/Ser, Ala/Gly, Ala/Thr, 
Ser/Asn, Ala/Val, Ser/Gly, Tyr/Phe, Ala/Pro, Lys/Arg, Asp/Asn, Leu/Ile, Leu/Val, 
Ala/Glu, and Asp/Gly as well as these in reverse. 

[47] In another embodiment, the present invention is directed to a fragment of (a), 
(b) or (c), as described above in the first paragraph of the Detailed Description, that 
has from about 1 to about 30 uM (3-alanine produced /hour 1 cell OD at pH 7.0-7.6, 
25°C, such as determined by the method of Example 8. By the term "fragment" is 
meant that the polypeptide has a deletion of 1 to 8 amino acid residues from the N- 
terminus or 1-40 residues from the C-terminus, or both. Preferably, tlie deletion is 1 
to 20 residues from the C-terminus, more preferably, the deletion is 1 to 10 residues 
from the C-terminus. 

Polynucleotides 

[48] In its second aspect, the present invention is directed to a polynucleotide 
sequence that encodes for an AAM polypeptide of the present invention. Given the 
degeneracy of the genetic code, the present invention is also directed to any 
polynucleotide that encodes for the above referenced AAM polypeptides of the 
present invention. In its second aspect, the present invention is directed to a 
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polynucleotide sequence that encodes for the correspondingly referenced AAM 
polypeptide. Given the degeneracy of the genetic code, the present invention is also 
directed to any polynucleotide that encodes for the above referenced AAM 
polypeptides of the present invention. In a preferred embodiment, the present 
invention is directed to certain specific polynucleotides of SEQ ID NOS: 1, 3, 5, 7, 9, 
11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47 and 49 that 
encode for the novel AAM polypeptides of SEQ ID NOS: 2, 4, 6, 8, 10, 12, 14, 16, 
18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48 and 51, respectively. 
Preferred polynucleotides encode for a polypeptide of SEQ ID NO: 2, 6, 12, 16, 20, 
24, 28, 30, 32, 34, 38, 44, 46 or 48; more preferably they encode a polypeptide of 
SEQ ID NO: 6, 12, 28, 34, 46 or 48; most preferably, they have a polypeptide of 
sequence of SEQ ID NO: 28 or 34. 

[49] To make the improved AAM polypeptides of the present invention, one starts 
with one or more wild-type polynucleotides that encode a KAM polypeptide. The 
term "wild-type" polynucleotide means that the nucleic acid fragment does not 
comprise any mutations from the form isolated from nature. The term "wild-type" 
protein means that the protein will be active at a level of activity found in nature and 
typically will comprise the amino acid sequence as found in nature. Thus, the term 
"wild type" or "grand-parent sequence" indicates a starting or reference sequence 
prior to a manipulation of the invention. 

[50] Suitable sources of wild-type KAM as a starting material to be improved is 
readily identified by screening genomic libraries for the KAM activity. A particularly 
suitable source of KAM is the yodO gene of Bacillus sp. bacteria as found in nature. 
Using the published KAM gene sequences for B. subtilis (e.g., WO 03 0623173 A2), 
primers for amplification of the genes from their respective gene libraries were 
created using conventional techniques. One such technique for isolating the KAM of 
B. subtilis is disclosed in Chen et al, "A novel lysine 2,3-aminomutase encoded by 
theyodO gene of Bacillus subtilis: characterization on observation of organic radical 
intermediates," Biochem J. 348:539-549 (2000), which is incorporated herein by 
reference. 
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[51] The starting polynucleotides of SEQ ID NOs: 52, 54, 56 and 58 were obtained 
using the techniques discloses in WO 03 0623173 A2 which is incorporated herein by 
reference for the disclosure of those techniques as recited in the examples therein. 
Specifically, WO 03 0 623173 A2 discloses a B. subtilis wild-type lysine 2,3- 
aminomutase (KAM), and a mutated form thereof, which encodes an alanine 2,3- 
aminomutase (AAM). In addition, WO 03 0623173 A2 also discloses a P. gingivalis 
wild-type lysine 2,3-aminomutase (KAM) and a mutated form thereof, which encodes 
an alanine 2,3-aminomutase (AAM). 

[52] Beginning with the polynucleotide of SEQ ID NO: 58, a non-naturally 
occurring and mutated and/or evolved enzyme, having unknown AAM activity is 
generated using any one of the well-known mutagenesis or directed evolution 
methods. See, e.g., Ling, et al., "Approaches to DNA mutagenesis: an overview," 
Anal. Biochem. . 254(2):157-78 (1997); Dale, et al, "Oligonucleotide-directed 
random mutagenesis using the phosphorothioate method," Methods Mol. Biol. , 
57:369-74 (1996); Smith, "In vitro mutagenesis," Ann. Rev. Genet., 19:423-462 

(1985) ; Botstein, et al., " Strategies and applications of in vitro mutagenesis," Science , 
229:1193-1201 (1985); Carter, "Site-directed mutagenesis," Biochem. J. , 237:1-7 

(1986) ; Kramer, et al., "Point Mismatch Repair," Cell, 38:879-887 (1984); Wells, et 
al., "Cassette mutagenesis: an efficient method for generation of multiple mutations at 
denned sites," Gene , 34:315-323 (1985); Minshull, et al., "Protein evolution by 
molecular breeding," Current Opinion in Chemical Biology . 3:284-290 (1999); 
Christians, et al., "Directed evolution of thymidine kinase for AZT phosphorylation 
using DNA family shuffling," Nature Biotechnology . 17:259-264 (1999); Crameri, et 
al., "DNA shuffling of a family of genes from diverse species accelerates directed 
evolution," Nature , 391:288-291; Crameri, et al., "Molecular evolution of an arsenate 
detoxification pathway by DNA shuffling," Nature Biotechnology , 15:436-438 
(1997); Zhang, et al., "Directed evolution of an effective fucosidase from a 
galactosidase by DNA shuffling and screening," Proceedings of the National 
Academy of Sciences. U.S.A.. 94:45-4-4509; Crameri, et al., "Improved green 
fluorescent protein by molecular evolution using DNA shuffling," Nature 
Biotechnology < 14:315-319 (1996); Stemmer, "Rapid evolution of a protein in vitro 
by DNA shuffling," Nature , 370:389-391 (1994); Stemmer, "DNA shuffling by 
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random fragmentation and reassembly: In vitro recombination for molecular 
evolution," Proceedings of the National Academy of Sciences, "U.S.A., 91:10747- 
10751 (1994); WO 95/22625; WO 97/0078; WO 97/35966; WO 98/27230; WO 
00/42651; WO 01/75767 and U.S. Pat. 6,537,746 which issued to Arnold, et al. on 
March 25, 2003 and is entitled "Method for creating polynucleotide and polypeptide 
sequences." 

[53] Any of these methods can be applied to generate AAM polynucleotides. To 
maximize any diversity, several of the above-described techniques can be used 
sequentially. Typically, a library of shuffled polynucleotides is created by one 
mutagenic or evolutionary technique and their expression products are screened to 
find the polypeptides having the highest AAM activity. Then, a second mutagenic or 
evolutionary technique is applied to polynucleotides encoding the most active 
polypeptides to create a second library, which in turn is screened for AAM activity by 
the same technique. The process of mutating and screening can be repeated as many 
times as needed, including the insertion of point mutations, to arrive at a 
polynucleotide that encodes a polypeptide with the desired activity, thermostability, or 
cofactor preference. 

[54] Alternatively, polynucleotides and oligonucleotides of the invention can be 
prepared by standard solid-phase methods, according to known synthetic methods. 
Typically, fragments of up to about 100 bases are individually synthesized, then 
joined (e.g., by enzymatic or chemical litigation methods, or polymerase mediated 
methods) to form essentially any desired continuous sequence. For example, 
polynucleotides and oligonucleotides of the invention can be prepared by chemical 
synthesis using, e.g., the classical phosphoramidite method described by Beaucage et 
al. (1981) Tetrahedron Letters 22:1859-69, or the method described by Matthes et al. 
(1984) EMBO J. 3:801-05, e.g., as it is typically practiced in automated synthetic 
methods. According to the phosphoramidite method, oligonucleotides are 
synthesized, e.g., in an automatic DNA synthesizer, purified, annealed, ligated and 
cloned in appropriate vectors. 

[55] In addition, essentially any nucleic acid can be custom ordered from any of a 
variety of commercial sources, such as The Midland Certified Reagent Company, 
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Midland, TX, The Great American Gene Company, Ramona, CA, ExpressGen Inc., 
Chicago, IL, Operon Technologies Inc., Alameda, CA, all of which have internet web 
sites, and many others. Similarly, peptides and antibodies can be custom ordered 
from any of a variety of sources, such as PeptidoGenic, HTI Bio-products, Inc., BMA 
Biomedicals Ltd. (U.K.), Bio.Synthesis, Inc., and many others. 

[56] Polynucleotides may also be synthesized by well-known techniques as 
described in the technical literature. See, e.g., Carruthers et al, Cold Spring Harbor 
Symp. Quant. Biol. 47:411-418 (1982), and Adams et al, J. Am. Chem. Soc. 105:661 
(1983). Double stranded DNA fragments may then be obtained either by synthesizing 
the complementary strand and annealing the strands together under appropriate 
conditions, or by adding the complementary strand using DNA polymerase with an 
appropriate primer sequence. 

[57] General texts which describe molecular biological techniques useful herein, 
including mutagenesis, include Berger and Kimmel, Guide to M olecular Cloning 
Techniques. Methods in Enzvmology, volume 152 Academic Press, Inc., San Diego, 
CA ("Berger"); Sambrook et al, Molecular Cloning - A Laboratory Manual (2nd Ed.), 
volumes 1-3, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, 1989 
("Sambrook"); and Current Protocols in Molecular Biology. F.M. AusubeZ et al., eds., 
Current Protocols, a joint venture between Greene Publishing Associates, Inc. and 
John Wiley & Sons, Inc. (supplemented through 2000) ("Ausubel")). Examples of 
techniques sufficient to direct persons of skill through in vitro amplification methods, 
including the polymerase chain reaction (PCR) the ligase chain reaction (LCR), Q0- 
replicase amplification and other RNA polymerase mediated techniques (e.g., 
NASBA) are found in Berger, Sambrook, and Ausubel, as well as Mullis et al, (1987) 
U.S. Patent No. 4,683,202; PCR Protocols A Guided to Methods and Applications 
(Innis et al, eds.) Academic Press Inc. San Diego, CA (1990); Arnheim & Levinson 
(October 1, 1990) Chemical and Engineering News 36-47; The Journa l Of NTH 
Research (1991) 3:81-94; Kwoh et al. (1989) Proc. Natl. Acad. Sci. USA 86:1173; 
Guatelli et al (1990) Proc. Natl. Acad. Sci. USA 87:1874; Lomell et al (1989) J. 
Clin. Chem. 35:1826; Landegren et al, (1988) Science 241:1077-1080; Van Brunt 
(1990) Biotechnology 8:291-294; Wu and Wallace, (1989) Gene 4:560; Barringer et 
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al. (1990) Gene 89:117, and Sooknanan and Malek (1995) Biotechnology 13:563- 
564. Improved methods of cloning in vitro amplified nucleic acids aTe described in 
Wallace et ah, U.S. Pat. No. 5,426,039. Improved methods of amplifying large 
nucleic acids by PCR are summarized in Cheng et ah (1994) Nature 369:684-685 and 
the references therein, in which PCR amplicons of up to 40kb are generated. One of 
skill will appreciate that essentially any RNA can be converted into a double stranded 
DNA suitable for restriction digestion, PCR expansion and sequencing using reverse 
transcriptase and a polymerase. See, Ausubel, Sambrook and Berger, all supra. 
[58] It will be appreciated by those skilled in the art due to the degeneracy of the 
genetic code, a multitude of nucleotide sequences encoding AAM polypeptides of the 
invention may be produced, some of which bear substantial identity to the nucleic 
acid sequences explicitly disclosed herein. It is also within the scope of the present 
invention that the polynucleotides encoding the AAM polypeptides of the present 
invention may be codon optimized for optimal production from the host organism 
selected for expression. Those having ordinary skill in the art will recognize that 
tables and other references providing codon preference information for a wide range 
of organisms are readily available. See e.g., Henaut and Danchin, "Escherichia coli 
and Salmonella," Neidhardt, et al. Eds., ASM Press, Washington D.C., p. 2047-2066 
(1996). 

[59] It is to be noted that expression in E. coli is different than in other organisms. 
For example, in the present invention, the codon (tgg) encodes Trp (W) for residue 
position 31 in the parent polypeptide of SEQ ID NO: 59. However, the corresponding 
codon for residue position 3 1 is "tga" in each of the progeny polynucleotides of SEQ 
IDNOs: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 
45, and 47 encoding for the AAM polypeptides of SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 
16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, and 48, respectively. One 
skilled in the art recognizes that the codon "tga" is usually a stop (nonsense) codon. 
However, in the present expression system used in the ApanD E. coli strain, and under 
the selection conditions imposed, this codon is read through by the E. coli as a sense 
codon and is expressed, presumably as Trp (W). Others have reported that "tga" is 
the weakest stop codon for E. coli and that it is often read through as a sense codon 
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for Trp (W) in high expression. See eg., Parker, J., "Errors and Alternatives in 
Reading the universal Genetic Code," Microbiological Reviews, 53(3): 273-298 
(1989); Roth, J., "UGA Nonsense Mutations in Salmonella typhimurium," J. of 
Bacteriology, 102(2):467-475 (1970); and McBeath, G. and Kast, P., "UGA Read- 
Through Artifacts — When Popular Gene Expression Systems Need a Patch," 
BioTechniques, 24:789-794 (May 1998), which are incorporated herein by reference. 
Hence for expression in non-is. coli systems, it would be advantageous to alter the 
codon (tga) at residue position 31 to "tgg" which is the universal sense codon for Trp 
(W). 

[60] In SEQ ID NO: 49, the codon encoding for residue 72 is "tag" which is read as 
a stop codon. However, two fragments are produced. The first fragment, having 
residues 1-71 of SEQ ID NO: 50, does not have any detectable AAM activity. The 
second fragment that is produced begins with residue 73 (Val) instead of the usual 
Met. This second fragment has 399 residues (SEQ ED NO: 51) and does have 
significant AAM activity (see Table 2) based upon the assay of Example 8. Thus, the 
first 72 residues at the N-terminus of the AAM polypeptide (based upon the 
consensus sequence or the parental KAM sequence from B. subtilis) are not 
absolutely necessary for AAM activity. 

[61] In the present case, several round No. 1 libraries were created by applying a 
variety of mutagenic techniques to the polynucleotides of SEQ ID NOs: 52, 54, 56 
and 58. 

[62] In its third aspect, the present invention is directed to an expression vector and 
to a host cell comprising a polynucleotide of the present invention operatively linked 
to a control sequence. To obtain expression of the -variant gene encoding an AAM 
polypeptide, the variant gene was first operatively linked to one or more heterologous 
regulatory sequences that control gene expression to create a nucleic acid construct, 
such as an expression vector or expression cassette. Thereafter, the resulting nucleic 
acid construct, such as an expression vector or expression cassette, was inserted into 
an appropriate host cell for ultimate expression of the AAM polypeptide encoded by 
the shuffled gene. A "nucleic acid construct" is defined herein as a nucleic acid 
molecule, either single-or double-stranded, which, is isolated from a naturally 
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occurring gene or which has been modified to contain segments of nucleic acid 
combined and juxtaposed in a manner that would not otherwise exist in nature. Thus, 
in one aspect, the present invention is directed to a nucleic acid construct comprising 
a polynucleotide encoding an AAM polypeptide of the present invention. 

[63] The term "nucleic acid construct" is synonymous with the term "expression 
cassette" when the nucleic acid construct contains all the control sequences required 
for expression of a coding sequence of the present invention. The term "coding 
sequence" is defined herein as a nucleic acid sequence, which directly specifies the 
amino acid sequence of its protein product. A coding sequence can include, but is not 
limited to, DNA, cDNA, and recombinant nucleic acid sequences. 
[64] An isolated polynucleotide encoding an AAM polypeptide of the present 
invention may be manipulated in a variety of ways to provide for expression of the 
polypeptide. Manipulation of the isolated polynucleotide prior to its insertion into a 
vector may be desirable or necessary depending on the expression vector. The 
techniques for modifying polynucleotides and nucleic acid sequences utilizing 
recombinant DNA methods are well known in the art. 

[65] The term "control sequence" is defined herein to include all components, 
which are necessary or advantageous for the expression of a polypeptide of the 
present invention. Each control sequence may be native or foreign to the nucleic acid 
sequence encoding the polypeptide. Such control sequences include, but are not 
limited to, a leader, polyadenylation sequence, propeptide sequence, promoter, signal 
peptide sequence, and transcription terminator. At a minimum, the control sequences 
include a promoter, and transcriptional and translational stop signals. The control 
sequences may be provided with linkers for the purpose of introducing specific 
restriction sites facilitating ligation of the control sequences with the coding region of 
the nucleic acid sequence encoding a polypeptide. 

[66] The term "operably linked" is defined herein as a configuration in which a 
control sequence is appropriately placed at a position relative to the coding sequence 
of the DNA sequence such that the control sequence directs the expression of a 
polypeptide. 
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[67] The control sequence may be an appropriate promoter sequence. The 
"promoter sequence" is a relatively short nucleic acid sequence that is recognized by a 
host cell for expression of the longer coding region that follows. The promoter 
sequence contains transcriptional control sequences, which mediate the expression of 
the polypeptide. The promoter may be any nucleic acid sequence which shows 
transcriptional activity in the host cell of choice including mutant, truncated, and 
hybrid promoters, and may be obtained from genes encoding extracellular or 
intracellular polypeptides either homologous or heterologous to the host cell. 
[68] For bacterial host cells, suitable promoters for directing the trans cription of the 
nucleic acid constructs of the present invention, include the promoters obtained from 
the E. coli lac operon , Streptomyces coelicolor agarase gene (dagA), Bacillus subtilis 
levansucrase gene (sacB), Bacillus licheniformis alpha-amylase gene (armyL), Bacillus 
stearothermophilus maltogenic amylase gene (amyM), Bacillus amyloliquefaciens 
alpha-amylase gene (amyQ), Bacillus licheniformis penicillinase gene (penP), 
Bacillus subtilis xylA and xylB genes, and prokaryoric beta-lactamase gene (Villa- 
Kamaroff et al., 1978, Proceedings of the National Academy of Sciences USA 75: 
3727-3731), as well as the tac promoter (DeBoer et al., 1983, Proceedings of the 
National Academy of Sciences USA 80: 21-25). Further promoters are described in 
"Useful proteins from recombinant bacteria" in Scientific American, 1 980, 242: 74- 
94; and in Sambrook et al, 1989, supra. 

[69] For filamentous fungal host cells, suitable promoters for directing the 
transcription of the nucleic acid constructs of the present invention include promoters 
obtained from the genes for Aspergillus oryzae TAKA amylase, Rhizomncor miehei 
aspartic proteinase, Aspergillus niger neutral alpha-amylase, Aspergillus niger acid 
stable alpha-amylase, Aspergillus niger at Aspergillus awamori glucoarnylase (glaA), 
Rhizomucor miehei lipase, Aspergillus oryzae alkaline protease, Aspergillus oryzae 
triose phosphate isomerase, Aspergillus nidulans acetamidase, and Fusarium 
oxysporum trypsin-like protease (WO 96/00787), as well as the NA2-tpi promoter (a 
hybrid of the promoters from the genes for Aspergillus niger neutral alpha-amylase 
and Aspergillus oryzae triose phosphate isomerase), and mutant, truncated, and hybrid 
promoters thereof. 
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[70] In a yeast host, useful promoters are obtained from the genes for 
Saccharomyces cerevisiae enolase (ENO-1), Saccharomyces cerevisiae galactokinase 
(GAL1), Saccharomyces cerevisiae alcohol dehydrogenase/gIyceraldehyde-3- 
phosphate dehydrogenase (ADH2/GAP), and Saccharomyces cerevisiae 3- 
phosphoglycerate kinase. Other useful promoters for yeast host cells are described by 
Romanos et al„ 1992, Yeast 8:423-488. 

[71] The control sequence may also be a suitable transcription terminator sequence, 
a sequence recognized by a host cell to terminate transcription. The terminator 
sequence is operably linked to the 3' terminus of the nucleic acid sequence encoding 
the polypeptide. Any terminator, which is functional in the host cell of choice, may be 
used in the present invention. 

[72] Preferred terminators for filamentous fungal host cells are obtained from the 
genes for Aspergillus oryzae TAKA amylase, Aspergillus niger glucoamylase, 
Aspergillus nidulans anthranilate synthase, Aspergillus niger alpha-glucosidase, and 
Fusarium oxysporum trypsin-like protease. 

[73] Preferred terminators for yeast host cells are obtained from the genes for 
Saccharomyces cerevisiae enolase, Saccharomyces cerevisiae cytochrome C (CTYC1), 
and Saccharomyces cerevisiae glyceraldehyde-3-phosphate dehydrogenase. Other 
useful terminators for yeast host cells are described by Romanos et al., 1992, supra. 

[74] The control sequence may also be a suitable leader sequence, a nontranslated 
region of an mRNA which is important for translation by the host cell. The leader 
sequence is operably linked to the 5* terminus of the nucleic acid sequence encoding 
the polypeptide. Any leader sequence that is functional in the host cell of choice may 
be used in the present invention. Preferred leaders for filamentous fungal host cells 
are obtained from the genes for Aspergillus oryzae TAKA amylase and Aspergillus 
nidulans triose phosphate isomerase. Suitable leaders for yeast host cells are obtained 
from the genes for Saccharomyces cerevisiae enolase (ENO-1), Saccharomyces 
cerevisiae 3-phosphoglycerate kinase, Saccharomyces cerevisiae alpha-factor, and 
Saccharomyces cerevisiae alcohol dehydrogenase/glyceraldehyde-3-phosphate 
dehydrogenase (ADH2/GAP). 
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[75] The control sequence may also be a polyadenylation sequence, a sequence 
operably linked to the 3' terminus of the nucleic acid sequence and which, when 
transcribed, is recognized by the host cell as a signal to add polyadenosine residues to 
transcribed mRNA. Any polyadenylation sequence that is functional in the host cell 
of choice may be used in the present invention. Preferred polyadenylation sequences 
for filamentous fungal host cells are obtained from the genes for Aspergillus oryzae 
TAKA amylase, Aspergillus niger glucoamylase, Aspergillus nidulans anthranilate 
synthase, Fusarium oxysporum trypsin-like protease, and Aspergillus niger alpha- 
glucosidase. Useful polyadenylation sequences for yeast host cells are described by 
Guo and Sherman, 1995, Molecular Cellular Biology 15: 5983-5990. 
[76] The control sequence may also be a signal peptide coding region that codes for 
an amino acid sequence linked to the amino terminus of a polypeptide and directs the 
encoded polypeptide into the cell's secretory pathway. The 5' end of the coding 
sequence of the nucleic acid sequence may inherently contain a signal peptide coding 
region naturally linked in translation reading frame with the segment of the coding 
region that encodes the secreted polypeptide. Alternatively, the 5' end of the coding 
sequence may contain a signal peptide coding region that is foreign to the coding 
sequence. The foreign signal peptide coding region may be required where the coding 
sequence does not naturally contain a signal peptide coding region. 
[77] Alternatively, the foreign signal peptide coding region may simply replace the 
natural signal peptide coding region in order to enhance secretion of the polypeptide. 
However, any signal peptide coding region that directs the expressed polypeptide into 
the secretory pathway of a host cell of choice may be used in the present invention. 
[78] Effective signal peptide coding regions for bacterial host cells are the signal 
peptide coding regions obtained from the genes for Bacillus NC1B 11837 maltogenic 
amylase, Bacillus stearothermophilus alpha-amylase, Bacillus licheniformis subtilisin, 
Bacillus licheniformis beta-lactamase, Bacillus stearothermophilus neutral proteases 
(nprT, nprS, nprM), and Bacillus subtilis prsA. Further signal peptides are described 
by Simonen and Palva, 1993, Microbiological Reviews 57: 109-137. 
[79] Effective signal peptide coding regions for filamentous fungal host cells are 
the signal peptide coding regions obtained from the genes for Aspergillus oryzae 
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TAKA amylase, Aspergillus niger neutral amylase, Aspergillus niger glucoamylase, 
Rhizomucor miehei aspartic proteinase, Humicola insolens cellulase, and Humicola 
lanuginosa lipase. 

[80] Useful signal peptides for yeast host cells are obtained from the genes for 
Saccharomyces cerevisiae alpha-factor and Saccharomyces cerevisiae invertase. 
Other useful signal peptide coding regions are described by Romanos et al, 1992, 
supra. 

[81] The control sequence may also be a propeptide coding region that codes for an 
amino acid sequence positioned at the amino terminus of a polypeptide. The resultant 
polypeptide is known as a proenzyme or propolypeptide (or a zymogen in some 
cases). A propolypeptide is generally inactive and can be converted to a mature active 
polypeptide by catalytic or autocatalytic cleavage of the propeptide from the 
propolypeptide. The propeptide coding region may be obtained from the genes for 
Bacillus subtilis alkaline protease (aprE), Bacillus subtilis neutral protease (nprT), 
Saccharomyces cerevisiae alpha-factor, Rhizomucor miehei aspartic proteinase, and 
Myceliophthora thermophila lactase (WO 95/33836). 

[82] Where both signal peptide and propeptide regions are present at the amino 
terminus of a polypeptide, the propeptide region is positioned next to the amino 
terminus of a polypeptide and the signal peptide region is positioned next to the amino 
terminus of the propeptide region. 

[83] It may also be desirable to add regulatory sequences, which allow the 
regulation of the expression of the polypeptide relative to the growth of the host cell. 
Examples of regulatory systems are those which cause the expression of the gene to 
be turned on or off in response to a chemical or physical stimulus, including the 
presence of a regulatory compound. In prokaryotic host cells, suitable regulatory 
sequences include the lac, tac, and trp operator systems. In yeast host cells, suitable 
regulatory systems include the ADH2 system or GAL1 system. In filamentous fungi, 
suitable regulatory sequences include the TAKA alpha-amylase promoter, Aspergillus 
niger glucoamylase promoter, and Aspergillus oryzae glucoamylase promoter. 
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[84] Other examples of regulatory sequences are those which allow for gene 
amplification. In eukaryotic systems, these include the dihydrofolate reductase gene, 
which is amplified in the presence of methotrexate, and Hie metallothionein genes, 
which are amplified with heavy metals. In these cases, the nucleic acid sequence 
encoding the AAM polypeptide of the present invention would be operably linked 
with the regulatory sequence. 

Expression Vectors 

[85] In another aspect, the present invention is also directed to a recombinant 
expression vector comprising a polynucleotide of the present invention (which 
encodes an AAM polypeptide of the present invention), and one or more expression 
regulating regions. An expression regulating region includes a promoter, a 
terminator, a replication origin, etc., depending on the type of hosts into which they 
are to be introduced. The various nucleic acid and control sequences described above 
may be joined together to produce a recombinant expression vector which may 
include one or more convenient restriction sites to allow for insertion or substitution 
of the nucleic acid sequence encoding the polypeptide at such sites. Alternatively, the 
nucleic acid sequence of the present invention may be expressed by inserting the 
nucleic acid sequence or a nucleic acid construct comprising the sequence into an 
appropriate vector for expression. In creating the expression vector, the coding 
sequence is located in the vector so that the coding sequence is operably linked with 
the appropriate control sequences for expression. 

[86] The recombinant expression vector may be any vector (e.g., a plasmid or 
virus), which can be conveniently subjected to recombinant DNA procedures and can 
bring about the expression of the polynucleotide' sequence. The choice of the vector 
will typically depend on the compatibility of the vector with the host cell into which 
the vector is to be introduced. The vectors may be linear or closed circular plasmids. 
[87] The expression vector may be an autonomously Teplicating vector, i.e., a 
vector that, exists as an extrachromosomal entity, the replication of which is 
independent of chromosomal replication, e.g., a plasmid, an extrachromosomal 
element, a minichromosome, or an artificial chromosome. The vector may contain 
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any means for assuring self-replication. Alternatively, the vector may be one which, 
when introduced into the host cell, is integrated into the genome and replicated 
together with the chromosome(s) into which it has been integrated. Furthermore, a 
single vector or plasmid or two or more vectors or plasmids which, together contain 
the total DNA to be introduced into the genome of the host cell, or a transposon may 
be used. 

[88] The expression vector of the present invention preferably contains one or more 
selectable markers, which permit easy selection of transformed cells. A selectable 
marker is a gene the product of which provides for biocide or viral resistance, 
resistance to heavy metals, prototrophy to autotrophs, and the like. Examples of 
bacterial selectable markers are the dal genes from Bacillus subtilis or Bacillus 
licheniformis, or markers, which confer antibiotic resistance such as ampicillin, 
kanamycin, chloramphenicol (Example 1) or tetracycline resistance. Suitable markers 
for yeast host cells are ADE2, HIS3, LEU2, LYS2, MET3, TKP1, and URA3. 
[89] Selectable markers for use in a filamentous fungal host cell include, but are 
not limited to, amdS (acetamidase), argB (ornithine carbamoyltransferase), bar 
(phosphinothricin acetyltransferase), hph (hygromycin phosphotransferase), niaD 
(nitrate reductase), pyrG (ororidine-5'-phosphate decarboxylase), (sulfate 
adenyltransferase), and trpC (anthranilate synthase), as well as equivalents thereof. 
Preferred for use in an Aspergillus cell are the amdS and pyrG genes of Aspergillus 
nidulans ox Aspergillus oryzae and the bar gene of Streptomyces hygroscopicus. 

[90] The vectors of the present invention preferably contain an element(s) that 
permits integration of the vector into the host cell's genome or autonomous replication 
of the vector in the cell independent of the genome. For integration, into the host cell 
genome, the vector may rely on the nucleic acid sequence encoding the polypeptide or 
any other element of the vector for integration of the vector into the genome by 
homologous or nonhomologous recombination. 

[91] Alternatively, the vector may contain additional nucleic acid sequences for 
directing integration by homologous recombination into the genome of the host cell. 
The additional nucleic acid sequences enable the vector to be integrated into the host 
cell genome at a precise location(s) in the cbxomosome(s). To increase the likelihood 



WO 2006/047589 



PCI7US2005/038552 



-29- 

of integration at a precise location, the integrational elements should preferably 
contain a sufficient number of nucleic acids, such as 100 to 10,000 base pairs, 
preferably 400 to 10,000 base pairs, and most preferably 800 to 10,000 base pairs, 
which are highly homologous with the corresponding target sequence to enhance th.e 
probability of homologous recombination. The integrational elements may be any 
sequence that is homologous with the target sequence in the genome of the host cell. 
Furthermore, the integrational elements may be non-encoding or encoding nucleic 
acid sequences. On the other hand, the vector may be integrated into the genome of 
the host cell by non-homologous recombination. 

[92] For autonomous replication, the vector may further comprise an origin of 
replication enabling the vector to replicate autonomously in the host cell in question. 
Examples of bacterial origins of replication are P15A, pSClOl, pMBl and ColEl. 
Origins of replication of plasmids pBR322 (which has a pMBl origin of replication) 
pUC19 (which has a ColEl origin of replication), pACYC177 and pACYC184 (whiclh 
have a P15A origin of replication), permit replication in E. coli; origins of replicatioai 
for plasmids pUBl 10, pE194, pTA1060, or pAM.beta.l permit replication in Bacillus. 
Examples of origins of replication for use in a yeast host cell are the 2 micron origin 
of replication, ARS1, ARS4, the combination of ARS1 and CEN3, and th_e 
combination of ARS4 and CEN6. The origin of replication may be one having a 
mutation which makes its functioning temperature-sensitive in the host cell (see, e.g-., 
Ehrlich, 1978, Proceedings of the National Academy of Sciences USA 75: 1433). 

[93] More than one copy of a nucleic acid sequence of the present invention mary 
be inserted into the host cell to increase production of the gene product. An increas e 
in the copy number of the nucleic acid sequence can be obtained by integrating at 
least one additional copy of the sequence into the host cell genome or by including aan 
amplifiable selectable marker gene with the nucleic acid sequence where cells 
containing amplified copies of the selectable marker gene, and thereby additional 
copies of the nucleic acid sequence, can be selected for by cultivating the cells in th_e 
presence of the appropriate selectable agent. 

[94] The procedures used to ligate the elements described above to construct th.e 
recombinant nucleic acid construct and expression vectors of the present invention are 
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well known to one skilled in the art (see, e.g., J. Sambrooik, E. F. Fritsch, and T. 
Maniatis, 1989, Molecular Cloning, A Laboratory Manual, 2d edition, Cold Spring 
Harbor, N.Y.). 

[95] Many of the expression vectors for use in the present invention are 
commercially available. Suitable commercial expression vectors include 
p3xFLAGTM™ expression vectors from Sigma-Aldrich Chemicals, St. Louis MO., 
which includes a CMV promoter and hGH polyadenylation site for expression in 
mammalian host cells and a pBR322 origin of replication and ampicillin resistance 
markers for amplification in E. coli. Other suitable expression vectors are 
pBluescripffl SK(-) and pBK-CMV, which are commercially available from 
Stratagene, LaJolla CA, and plasmids thai are derived from pBR322 (Gibco BRL), 
pUC (Gibco BRL), pREP4, pCEP4 (Invitrogene) or pPoly (Lathe et al., 1987, Gene 
57, 193-201). 

[96] Example 6 herein discloses the use of the expression vector pCKl 10900-1 Bla, 
as shown in the vector map of FIG. 3. 

Host Cells 

[97] Host cells for use in expressing the expression vectors of the present invention 
include but are not limited to, bacterial cells, such as E. coli, Streptomyces and 
Salmonella typhimurium cells; fungal cells, such as yeast cells (e.g., Saccharomyces 
cerevisiae or Pichia pastoris (ATCC Accession No. 201178)); insect cells such as 
Drosophila S2 and Spodoptera Sf9 cells; animal cells such as CHO, COS, 293, and 
Bowes melanoma cells; and plant cells. Appropriate culture mediums and conditions 
for the above-described host cells are well known in the art. 

[98] By way of example, Escherichia coli W3110 was transformed by an 
expression vector for expressing the shuffled genes of the present invention. The 
expression vector was created by operatively linking a variant gene of the present 
invention to the lac promoter under control of the lacl repressor gene. The expression 
vector also contained the P15A origin of replication and the chloroamphenicol 
resistance gene. The transformed Escherichia coli W3110 was cultured under 
appropriate culture medium containing chloramphenicol such that only transformed E 
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coli cells that expressed the expression vector survived. See e.g., Example 1. 
Purification 

[99] Once the AAM polypeptides were expressed by the variant genes in E. coli, 
the polypeptides were purified from the cells and or the culture medium using any one 
or more of the well known techniques for protein purification, including lysozyme 
treatment, sonication, filtration, salting, ultra-centrifugation, affinity chromatography, 
and the like under strict anoxic conditions. Suitable solutions for high efficiency 
extraction of proteins from bacteria, such as E. coli, axe commercially available under 
the trade name CelLytic B™ from Sigma-Aldrich of St. Louis MO. A suitable 
process for purifying AAM polypeptides sufficiently from cell lysate for applications 
in a chemical process is disclosed in the references: Chirpich, T. P. et al., J. Biol. 
Chem., 1970, 245, 1778-1789; and Petrovich, R. M. et al, J. Biol. Chem., 1991, 266, 
7656-7660, both of which are incorporated herein by reference. 

Screening 

[100] After several rounds of directed evolution were performed, the resulting 
libraries of exemplary AAM polypeptides were screened. Screening for transformed 
cells that express a polypeptide having AAM activity is, in general, a two-step 
process. First, one physically separates the cells and then determines which cells do 
and do not possess a desired property. Selection is a form of screening in which 
identification and physical separation are achieved simultaneously by expression of a 
selection marker, which, in some genetic circumstances, allows cells expressing the 
marker to survive while other cells die (or vice versa). Exemplary screening markers 
include luciferase, 3-galactosidase, and green fluorescent protein. Selection markers 
include drug and toxin resistance genes, such as resistance to chloramphenicol, 
ampicillin and the like. Although spontaneous selection can and does occur in the 
course of natural evolution, in the present methods selection is performed by man. 
[101] The AAM polynucleotides generated by the mutagenesis or directed evolution 
method are screened in accordance with the protocol described in Example 8 to 
identify those having enhanced activity that are suitable for inclusion as an improved 
AAM polypeptide of the present invention. In the process of Example 8, the 
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screening of clones from the expression libraries for enhanced AAM activity was 
performed by measuring the conversion of a-alanine to p-alanine using liquid 
chromatography and mass spectrometry. Based upon the screening results, the AAM 
polypeptides of the present invention axe listed in Table 2 below along with their 
residue changes and enhanced AAM activity relative to one parental AAM 
polypeptide, i.e., the polypeptide ofSEQ ID NO: 59. 



Table 2 



Seq. ID No. 


Residue changes relative to 
parent SEQ ID NO: 59 


Rate of 
P-alanine(uM) 
produced /hr 
1 Cell OD 


34 


T1T7T T^>T7Ayr (^^rtOD "MftQT 

11 //L, IZZ/M, CjoUoK, JL4U5.L, 
F416S.D447G 


31.9 


10 


129oV, (jjUoK, r^iob, 1J44 /Lj 




38 


D125N,I177L,T210S, 


11.0 


20 


K2E, I307L, 


14.7 


14 


K13E, L17R, LI 97P, I200T, 
M281V,F310S,F416S, D447G 


7.7 


22 


Y72H, L118P, R145L, I220V, 
F240L, S250P, R311C, F416S, 
D447G 


1.0 


42 


K19R, T99S, G308R, F416S, 
D447G 


3.5 


26 


N80K, G308R, E3 19G, R325G, 
Q350R 


4.8 


18 


Q32R, S74P, SI 13T, LI 18P, 
G308R, F416S, D447G 


3.9 


44 


D79E, G308R, S329P, F393S, 
F414S, D445G, L453S, 


12.9 


51 (fragment) 


A73V, G308R,Y331N, F416S, 
D447G 


7.0 


36 


D79E, S93P, N132D, M281L 
G308R,Y331N, F416S, D447G 


6.0 


48 


K2E, M76I, D79E, T131A, 
L203P, G308R, Y331C, F416S, 
D447G 


22.0 


12 


R38G, C134G, C141R, L203P, 
I280T, G308R, F-416S, D447G 


3.6 


4 


2KE, I220V, N237D, G308R, 
D360G, K361R, F416S, D447G 


4.5 
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16 


K13E, L17R,L197P, HOOT, 
M281V, G308R, F310S, F416S, 
D447G 


19.4 


24 


E23D, L43S, D124G, Y137H, 
K156E, G308R, D411G, F416S, 
D447G 


18.9 


46 


W18R, M76I, D79E, V90A, 
M152T, I163T, S178P, V215G, 
G308R, V354A, F416S, D447G 


20.7 


28 


E22G, Y71C, S74P, H108R, 
D187G,'l244V, G308R, E396G, 
F416S,D447G,F454S 


29.2 


40 


Y137H, G308R, D411G, F416S, 
D422V, D447G 


2.9 


32 


H35R, D79E, K98T, T99S, 
N132S, S135P, E204G, K230R, 
G308R,F416S, D447G 


13.6 


2 


W235R, S250P, C254R, D276G, 
G308R, Y380C, I381T, F416S, 
K440E.D447G 


17.5 


30 


Q32R, N67S, H140R, G308R, 
F416S,D447G 


14.3 


6 


E24G, M96I.E109G, G308R, 
F416S.D447G 


23.0 


8 


G308R, S329P, F416S, D447G, 
L455S 


14.7 



[102] In Table 2 above, it is seen that the AAM polypeptides of the present 
invention have from 2 to 1 1 residue differences than their parent polypeptide of SEQ 
ID NO: 59, and very significant AAM activity as evidenced by the production of 0- 
alanine in the assay of Example 8. In comparison, (3-alanine was not detected for 
SEQ ID NO: 59 under the assay conditions used to test the AAM variants. However, 
some 0-alanine production for parental SEQ ID NO: 5S> was detected in a qualitative 
growth based complementation assay. 

[103] Referring to Table 2 above, two preferred residue changes for the AMM 
polypeptides of the present invention relative to the parental sequence of SEQ ID NO: 
59 are G308R and F416S. In those AAM polypeptides of the present invention that 
are at least 447 residues long, an additional preferred residue change is D447G 
relative to the parental sequence of SEQ ID NO: 59 . Additional suitable residue 
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changes are G308K, F416M and D447L, A, I or V. Thus, in one aspect, the present 
invention is directed to an AAM polypeptide having at least 5 amino acid residue 
changes, typically 5-1 1 residue changes, relative to SEQ ID NO: 59 or a truncated 
fragment thereof as taught herein, the residue changes including from 1 to 3 residue 
changes selected from the group consisting of G308R, G308K, F416S, F416M, 
D447G, D447L, D447A, D447I and D447V. 

[104] Based upon the AAM activity in Table 2, an especially preferred AAM 
polypeptide of the present invention is a polypeptide having 95% sequence homology 
with the polypeptide of SEQ ID NO: 34, more preferably 98% homology, most 
preferably 99% homology. 

[105] The parental polypeptides of SEQ ID NOs: 53, 55 and 57 demonstrate that the 
residues 1-8 at the N-terminus and residues 434-473 at the C-terminus are not 
necessary for KAM or AAM activity. Likewise, the polypeptide fragment of SEQ ID 
NO: 51, which is a 399 residue expression product, discloses that the first 72 amino 
acids at the N-terminus relative to the parental clone of SEQ ID NO: 59 are not 
necessary for AAM activity. (See Table 2) Thus, it is also within the scope of the 
present invention that the polypeptides described herein include fragments thereof that 
lack from 1 to 72 residues from their N-terminus relative to the parental sequence of 
SEQ ID NO: 59, typically from 1 to 40 residues, more typically from 1-20 residues, 
most typically from 1 to 11 residues. It is also within, the scope of the present 
invention that the above described N-terminal truncation be utilized in combination 
with a C-terminal truncation as described elsewhere herein. 

[106] Only a very few (< 0.5%) of the mutations to the parental B. subtilis KAM 
(SEQ ID NO: 59) backbone were found to be beneficial. Specifically, for every 1000 
clones screened, there occurred only 3-5 single point or double point mutations that 
were beneficial. In fact, some of the mutations were found to be detrimental. 
[107] The first of the following two sets of sequences provides the sequence of the 
wild type B. subtilis lysine 2,3-aminomutase (KAM) polypeptides of the prior art, as 
deposited (GI_2529467_GB_AAB81159.1_). This sequence (SEQ ID NO: 60) was 
not used as a parent sequence but is provided only for purposes of comparison. 
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MKNKWYKPKRHWKEIELWKDVPEEKWNDWLWQLTHT 

VRTLDDLKKVINLTEDEEEGVRISTKTIPLNITPYYASL 

MDPDNPRCP VRMQSVPLSEEMHKTKYDLEDPLHEDED 

SRVPGLTHRYPDRVLFLVTNQCSMYCRYCTRRRFSGQI 

GMGVPKKQLD A A I A YIRE TP E I RD C LI S G G D G L L IND Q I 

LEYILKELRSIPHLEVIRIGTRAPVVFPQRITDHLCEILK 

KYHPVWLNTHFNTSIEMTEESVEACEKLVNAG.VPVGN 

QAVVLAGINDSVPIMKKLMHDLVKIRVRPYYIYQCDLS 

EGIGHFRAPVSKGLEIIEGLRGHTSGYAVPTFVVDAPGG 

GGKIALQPNYVLSQSPDKVILRNFEGVITSYPEPENYIP 

NQADAYFES VFPETADKKEPIGLS AI FADKEVSFTPENV 

D RIKRREAYIANPEHETLKDRRERRDQLKEKKFLAQQK 

KQKETECGGDSS 



[108] The second sequence in the set indicates the diversity of the AAM 
polypeptides of the present invention relative to the known wild-type B. subtilis KAM 
sequence by designating with the letter "X" followed by the residue number those 
residues in the Applicants' AAM polypeptides that differ from those of wild-type B. 
subtilis KAM sequence: 



MX2NKWYKPKRHWX, 3 EIEXi7WX ]9 DVPX 2 3 X 2 4 KWNDWLW 
X32 L T X35 T V X38 TLDDX43K.KVINLTEDEEEGVRISTKTIPL 
X 67 ITPX 7I X72X 7 3 X 74 LMDPX 79 X 8 oPRCPVRMQSVPLX93EEX 96 H 
X98X99KYDLEDPLX108 XiogDEDSXmVPGXngTHRYPX^RVLF 
LVTXmQXmXnsXiae X m C R X M0 Xmi T R R X, 45 F S G Q I G M G V P 
Xi 56 KQLDAAIAYIRETPEIRDCLISGGDGLLINXi 87 QILEYI 
L K E X197 R S X 2 oo P H X203 X 2 o4 VIRIGTRAPVVFPQRITDH X224 C E I 
LKX 2 3oX 2 3iHPVX235 LX237THX 24 oNTSIEMTEEX25oVEAX254EKL 
VNAGVPVGNQ AVVLAGINX 27 6 SVPX28oX 2 8iKKLMHDLVKI 
RVRPYYIYQCDLSEGX307 X 3 o8HX3ioX 3 „ APVSKGLX 3I9 HEGL 
RGHTX 329 GX 3 3i AVPTFVVX339APGGGGKIALX350PNYVLSQ 
S P X360K VIL RN F E G VITS YPE PENX380X381 PNQADAYFESV 
X393 PX395TADKKEPIGLSA X408 FAX4iiKEVSX4i6TPENV X422 R I 
KRREAYIANPEHETLX440DRREX44SRX447QLKEKKX454X455A 
QQKKQKETECGGDSS 



The diversity of changes at various residue positions for the AAM polypeptides of the 
present invention are shown to the right of the arrow in Table 2 below and relative 
amino acid residues of wild-type KAM of B. subtilis 
(GI_2529467_GB_AAB81159.1J (SEQ ID NO: 60) which are shown to the left of 
the arrow: 
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x 2 


K-*E 


X l3 : 


K-^E 


X 17 : 


L-»R 


X19: 


K-^R 


Xb: 


E— » D, G 


X24: 


E-» G 


X 32 : 


O-R, 


X35: 


H^R 


X3g: 


R-»G 


X43: 


L-> S 


X6 7 : 


N-*S 


X„: 


Y-> C 


X72: 


Y-*H,W 


X 73 : 


A-»V 


X74: 


S-»P 


X79: 


D-> E 


Xgo: 


N->K 


Xg3: 


S-*P 


X96: 


M-+I 


Xgg: 




X99: 


T^S 


X108 


H->R 


X109 


E-+G 


X U 4 


R-*P 


Xiig 


L->P 


X124 


D->N 


X[32 


N->D, S 


X134 


C->G 


X135 


S-*P 


Xi 3 6 


M^V 


X137 


Y-»H 


X140 


Y->H 


Xl41 


: C^R 


X145 


R-^L 


X 15 6 


K-+E 


Xl87 


D->G 


X197 


L-*P 


X200 


I->T 


X 2 03 


L-+P 


X204 


E-*G 


X224 


L-^P 


X230 


K->R 


x 23 , 


Y—H 


X 2 3S 


W->R 


X237 


N->D 
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^240- 


F— > L 




s— ► p 


x 5 °- 


C— ► Y,R 


x" • 


D— > G 


x 276 . 


I— > T 


x 8 °- 


M->I, V 




I-> L 


A 308- 


G— > R 




F-> S 


x 3 ir 


R_> C 


X 3 i9: 


E->G 


X329: 


S^P 


X331: 


Y-»N 


X339: 


D^H 


X350: 


O-R 


X360: 


D->G 


X361: 


K^R 


x 38 o: 


Y-+C 


X381: 


I->T 


X393: F-» S 


X395: 


E->G 


Xtos: 


I-»L 


Xh,: 


D->G 


X416: 


F-> S 


X422: 


D->V 


X440: 


K-+E 


X445: 


R-*K 


X447: 


D^G 


X454: 


F->S 


X455: 


L->S 



[109] In a fourth aspect, the present invention is directed to a method of making an 
AAM a nucleic polypeptide of the present invention comprising (a) cultivating a host 
cell transformed with a nucleic acid sequence encoding an AAM polypeptide of the 
present invention under conditions suitable for production of the polypeptide; and (b) 
providing glucose to the cultivated host cells under conditions suitable for the 
production of P-alanine. The P-alanine may be optionally recovered from the cells. 

Example 1: Transformation protocol for aam libraries/ ApanD strain 

[110] A mutant E. coli strain - ApanD, derived from BW251 13 which is described in 

Datsenko, KA. and Wanner, B.L., Proc. Natl. Acad. Sci. USA 97:6640-6645 (2000) 
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was used as the host strain for screening of the aam gene libraries. The protocol used 
to make the deletion is detailed in Example 4 of Cargill patent application WO 
03/062173. 

[Ill] Chemical competent E. coli ApanD was removed from -80°C frozen storage 
and thawed. Thereafter, it was kept on ice until used. An aliquot (lOOul per 
transformation) was transferred into a sterile 1.5ml centrifuge tube. A KCM (5X) salt 
solution was added until the concentration in the aliquot was IX. KCM consists of 
700 mM KC1; 10 mM morpholinopiopanesulphonic acid (MOPS) adjusted to pH 5.8. 
l-5ul of the ligation mixture was added to the cells. The cells containing the ligation 
mixture were first incubated on ice for 30 minutes. The cells were heat shocked at 
42°C for 1 min, and subsequently incubated on ice for 2 minutes. 500ul of SOC 
(Maniatis, T., Fritsch, E. F., and Sambrook, J. (1982) Molecular Cloning: A 
Laboratory Manual, 1st Ed., pp. A.2 and A3, Cold Spring Harbor Laboratory, Cold 
Spring Harbor, NY) was added to the cells, and the cells were incubated at 37°C for 1 
hour with agitation. The cells were then centrifuged at 5000 rpm for 3 minutes, and 
the SOC was removed. The cell pellet was re-suspended in 500|j.l of M9 selection 
medium ((Maniatis, T., Fritsch, E. F., and Sambrook, J. (1982) Molecular Cloning: 
A Laboratory Manual, 1st Ed., pp. A.2 and A3, Cold Spring Harbor Laboratory, Cold 
Spring Harbor, NY) and incubated at 30°C for 2-4 hours with agitation. The cells 
were then plated onto M9 minimal agar medium supplemented with 1% mannose, 
20uM iron citrate, 5.0 g/1 a-alanine, O.lmM isopropyl-p-D-thiogalactoside (TPTG) 
(Sigma Chemical Corp., St. Louis, MO), 50mM MOPS, 25mM bicarbonate, and 
30ug/ml chloramphenicol. The plated cells were incubated at 30°C for 3 days or until 
colonies were of sufficient size to be picked using the Q-BOT™ robot colony picker ( 
Genetix USA, Inc, Boston MA). 

[112] In Round 2 of the transformation, the above procedure was followed except 
that the incubation temperature of the last two incubations in the procedure was 
increased to 37°C, and M9 minimal selection medium was not supplemented with oc- 
alanine (0 g/L a-alanine). 
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A. Alternate Transformation protocol for aam libraries/ ApanD KLfldA 
strain 

[113] A mutant E. coli strain LpanJD, derived from BW251 13 which is described in 
Darsenko, K.A. and Wanner, B.L., Pxoc. Natl. Acad. Sci. USA 97:6640-6645 (2000) 
is used as the host strain for screening of the aam gene libraries. The protocol used to 
make the deletion is detailed in Example 4 of International patent publication WO 
03/062173. Optimally, a strain additionally having an increased expression of the 
flavodoxin (fldA) gene was used as the host strain for screening of the aam gene 
libraries, since increased flavodoxin enhances aminomutase activity when produced in 

E. coli. See USSN , by Cargill, Inc. (Liao, et al), filed October 14, 

2005, entitled "Increasing the Activity of Radical S-Adenosyl Methionine (SAM) 
Enzymes" describes the production of p-alanine from cells that express AAM and 
overexpress flavodoxin at Examples 1-4, and these examples are incorporated herein 
by reference. This same application, USSN , by Cargill, Inc. (Liao, et 

al.) filed October 14, 2005, describes in Example 4 (incorporated herein) the 
construction of a strain of E. coli in. which an artificial Pi ac /ara hybrid promoter was 
placed immediately upstream of the fldA gene. Strains carrying the artificial promoter 
before the fldA gene are designated YLifldA, where KI refers to "knock-in"). 

[114] Competent cells of E. coli ApanD KlfldA are prepared either chemically or 
electrochemically using standard protocols. Competent E. coli ApanD KlfldA was 
removed from -80°C frozen storage and thawed. Thereafter, it was kept on ice until 
used. An aliquot (lOOul per trans formation) was transferred into a sterile 1.5ml 
centrifuge tube. A KCM (5X) salt solution was added until the concentration in the 
aliquot was IX. KCM consists of 700 mM KC1; . 10 mM 
morpholinopropanesulphonic acid (MOPS) adjusted to pH 5.8. l-5|ixl of the ligation 
mixture was added to the cells. The cells containing the ligation mixture were first 
incubated on ice for 30 minutes. The cells were heat shocked at 42°C for 1 min, and 
subsequently incubated on ice for 2 minutes. 500ul of SOC (Maniatis, T., Fritsch, E. 

F. , and Sambrook, J. (1982) Molecular Cloning: A Laboratory Manual, 1st Ed., pp. 
A.2 and A.3, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY) was added to 
the cells, and the cells were incubated at 37°C for 1 hour with agitation. The cells 
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were then centrifuged at 5000 rpm for 3 minutes, and the SOC was removed. Pellets 
were subsequently resuspended in a medium appropriate for either the 
complementation assay (Example 3) or the biotransformation assay (Example 4). 

Example 2: Cloning of aam genes into pCK110900 series vectors 
[115] The strategy employed for cloning the alanine aminomutase genes into an 
inducible expression system involved the isolation of the aam gene by PCR and 
cloning of the PCR fragment into the SJ?/ restriction sites downstream from a mutant 
lac promoter/operator system. Initially, PCR primers were designed to contain a 
nucleotide sequence that is specific to the 5' and 3' ends of the aam gene, as well as 
the Shine-Delgarno sequence of the ribosome-binding site, and the unique Sfil 
restriction sites. The gene was then amplified from a template, purified and digested 
with the restriction endonuclease Sfil. The restricted PCR fragment was purified 
using the QIAquick PCR purification kit (Qiagen), and cloned into the Sfil sites of the 
expression vector pCKl 10900-1 Bla of FIG. 3 under the control of a lac promoter and 
lad repressor gene. The expression vector also contained the PI 5a origin of 
replication and the chloramphenicol resistance gene. Shuffled aam gene libraries 
were cloned by the same method. Several clones were found that expressed an active 
alanine 2,3-aminomutase (as per the method of Example 8) and the synthetic genes 
were sequenced. A polynucleotide sequence designated BSAAM (SEQ ID NO: 58) - 
was used as a starting material for all further mutations and shuffling. BSAAM (SEQ 
ID NO: 58) has approximately 99.2% nucleotide identity with the wild-type Bacillus 
subtilis lysine aminomutase (GenBank Accession No. H10329). 

Example 3 : Screening via the Tier 2a growth assay 
Tier 2a growth Assay 

[116] The growth assay identifies variants capable of generating the essential 
metabolite AcetylCoA via p-alanine produced by AAM variants in the E. coli ApanD 
host strain. Growth is therefore a function of CoA production, and indirectly of AAM 
activity. 
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A. Procedure 

[117] AAM active clones from the tier 1 complementation assay were picked with a 
QBOT™ robot colony picker (Genetix USA, Inc., Boston MA) and inoculated into a 
96-well master plate. The inoculums were grown in the 96 well master plate on a 
buffered minimal selection media (Na 2 HPO. 7H*0 12.8g/L; KH 2 P0 4 3g/L; NaCl 
0.5g/L; NH4CI lg/L; MgS0 4 2mM; CaCl 2 0.04mM; mannose 2%; IPTG ImM; ferric 
citrate 20 uM; chloramphenicol 30 ug/ml; MOPS pH 7, 50mM; and sodium 
bicarbonate pH 9, 25mM) (hereinafter "MSM") to Avhich was added O.luM [3-alanine 
and 0.5g/L a-alanine. Plates were covered using AirPore™ microporous tape 
(Qiagen, Inc.) and incubated at 25°C, 250 rpm, 85*% humidity until cultures reached 
saturation, at which time glycerol was added to the cultures to a final concentration of 
20-30%, and the plates stored at-80°C. 

[118] Samples from a frozen master plate were arrayed into an "inoculum" plate 
containing buffered minimal selection media (MSM), as described above, further 
containing 0.5g/L a-alanine. The inoculum plates were covered with AirPore™ 
microporous tape (Qiagen, Inc.) and incubated at 25°C, 250 rpm, 85% humidity until 
cultures reached saturation. 

[119] 15ul from the inoculum plate was inoculated into a 96-well "assay" plate 
containing 185ul of fresh MSM with 0.5g/L a-alanine. The assay plates were 
covered with AirPore™ microporous tape (Qiagen., Inc.) and a lid, and incubated at 
25°C, 85% humidity, 250rpm. Measurements of OD at 600nm were made at discrete 
times for a period of approximately (~) 40hours. 

B. Data Analysis 

[120] Since library variants exhibit unique growth profiles, it was preferable to 
calculate and compare growth rates (slopes) at three (3) different growth phases 
(early, mid and late) to identify all potentially improved variants. Clones that exhibit 
three (3) standard deviations above the plate average in any of the three (3) phases 
were designated as potentially improved variants and were retested in tier 2b for 
comparative ranking. 
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Example 4: Screening via the Tier 2b growth assay 

[121] The stringency of the growth screen is increased in Tier 2b by excluding oc- 
alanine (the substrate for AAM) from the medium. Under these conditions, the cell 
relies on internal/cellular pools of a-alanine to serve as a substrate for AAM, and 
subsequently, for cell growth. AAM variants capable of utilizing low, intracellular 
pools of a-alanine might represent low K M variants. 

A. Procedure 

[122] Samples from a frozen master plate were arrayed into an "inoculum" plate 
containing buffered minimal selection media (MSM), as described above, further 
containing 0.5g/L a-alanine. The inoculum plates were covered using AirPore TIM 
microporous tape and incubated at 25°C, 250 rpm, 85% humidity until cultures 
reached growth saturation. 

[123] A TECAN™ Robotic Sample Processor (Columbus, Ohio) was used to 
remove 10(4.1 of inoculum from each Tier 2a variant from the inoculum plates and seed 
it in replicates of 8 into each of the following: 

96-well Assay plate containing 190ulof fresh MSM, 0.5g/L a-alanine. 

96-well Assay plate containing 190ul of fresh MSM, containing no a-alanine. 

The Assay plates were covered with AirPore™ microporous tape and a lid and grown 

at 25°C, 85% humidity, 250rpm. Samples were collected at time points for 

approximately 3-4 days and the OD 60 onm was measured for each sample. 

B. Tier 2b Data Analysis 

[124] Variants were ranked by the following 3 criteria: 

i) Growth ratio equal to a final culture OD600 on medium without a-alanme/final 
culture OD 6 oonm on medium containing a-alanine; 

ii) Final culture ODsoo! and 

iii) Initial growth rates (in phase 1 , from approximately 0-20 hour) 



Clones with final culture ODgoonm > 0.7 were retained. 
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Clones were then ranked based on the growth ratio of criteria (i). Any clones with a 
OD 60 onm > 0- 7 were retained. However, clones that did not meet the above two 
criteria, but had a very good initial growth rate (iii) were also selected for further 
evaluation. 

Example 5: Screening via Tier 2c- PCR analysis 

The PCR screen identifies variants that contain the correct size gene in the expression 
vector prior to further screening for function. It excludes unstable gene variants that 
may have undergone deletions/truncations during the screening process. 

A. Procedure 

Potentially improved variants from frozen master plates were inoculated into a 96- 
microwell plate containing LB media with 1% glucose and 3 0(J.g/mL 
chloramphenicol. Cultures were grown at 25°C, 250 rpm, 85% humidity in plates 
covered with AirPore™ microporous tape (Qiagen, Inc.) until cultures reached 
saturation, approximately 2 days. IOuL of the culture was transferred to a PCR plate 
and boiled at 99°C for 10 minutes to disrupt the cells. Thereafter, 90 uT_ of the 
following PCR Master Mix was added to the disrupted cells: 

PCR Master Mix: 



10 uL 
4\xL 
2pL 



10X Taq Polymerase Buffer (QIAGEN, Valencia OA) 
25mMMgCl 2 



10 mM dNTPs 



1.25 pL 
1 uL 
70.5 jjL 



1.25 uL 



20 uM primer - Bf orwa rd (specific for BsAAM gene) 
20 uM primer - B,^^ (specific for BsAAM gene) 
5U/uL Taq polymerase (QIAGEN) 
Sterile water 



90 uL 



Total volume 



The Bacillus specific primers used in the PCR reaction are as follows: 
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B-forward: 

5'ccagcctggccataaggagatatacatatgaaaaacaaatggtataaac 3' SEQ ID NO: 63 
B-reverse: 

5* atggtgatggtgatggtggccagtttggccttatgaagaatcccctccgc 3* SEQ ID NO: 64 

The amplification reaction was run for 30 cycles. The first cycle was run at 94° C for 
1 minute. Thereafter, the extension procedure was performed for 29 cycles: 94.0°C 
for 1 minute; 55.0°C for 30 seconds; and 72.0°C for 1 minute. The final extension 
was performed at 72.0°C for 5 minutes. The products of the PCR reactions were 
analyzed by gel-electrophoresis on a 0.8% agarose gel. 

Example 6: Growth of AAM variants for p-alanine production (SO ml scale). 
Cell selection method for identifying AAM activity. 

[125] To identify genes encoding polypeptides that can perform the alanine 2,3- 
aminomutase reaction, an efficient screen or selection for the desired activity is 
needed. Therefore, a selection method was developed by recognizing that E. coli uses 
beta-alanine for the synthesis of pantothenic acid, which in turn is a component of 
coenzyme A (CoA) and of acyl carrier protein (ACP). CoA and ACP are the 
predominant acyl group carriers in living organisms, and are essential for growth.. 

[126] fn E. coli, the primary route to beta-alanine is from aspartate in a reaction 
catalyzed by aspartate decarboxylase (E.C. 4. 1. 1.1 1), which is encoded by the panD 
gene. A functional deletion mutation of panD (shown as ApanD) results in beta- 
alanine auxotrophy and growth inhibition, which can be alleviated by the exogenous 
addition of pantothenate or beta-alanine, or by the production of beta-alanine from 
another source. 

[127] Strain description: E. coli ApanD host (derived from BW25113, described in 
Datsenko, K.A. and Wanner, B.L., Proc. Natl. Acad. Sci. USA 97:6640-6645 (2O00)), 
transformed with pCKl 10900-1 Bla vector (low promoter strength resulting from 
mutated lac promoter sequence). The inoculum culture was grown in buffered 
minimal selection medium (MSM): M9 salts, pH 7.0-7.4, 50mM MOPs, pH 7 .0, 25 
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mM sodium bicarbonate, pH 9.0, ImM isopropyl-|3-D-thiogalactoside (EPTG), 
30ug/ml chloramphenicol, 0.1 g/L alanine, 5uM pyridoxine HC1, and 20uM fenic 
citrate. A 1 :20 dilution of inoculum was used to inoculate 50ml of MSM medium 
described above. Cultures were incubated at 25°C, 250 rpm for approximately 3 days 
or until the culture reaches OD 60 onm ~1 • Then, a-alanine was added to the medium to 
a final concentration of 300 mM, and pantothenate was added to about 300uM. 
Incubation of the supplemented medium continued at 25°C, 250 rpm. Samples were 
removed from the medium for analysis at time points from t= 0 through t=5 hours 
following the addition of a-alanine. 

Example 7: Method for extracting cells for p-alanine detection 
[128] Cells from the cultures of Example 6 were harvested by centrifugation of the 
cultures. The supernatant (spent media) was decanted and saved for further analysis 
(below). The cell pellets were washed with water. Pellets may be stored at -80°C for 
future extraction. The 50ml cell pellets (OD ~ 4.0) were re-suspended completely in 
a test tube in 0.9 ml water. The extraction volume for each sample was adjusted to 
this proportion according to the harvest ODgoo. An equal volume of methanol (-2Q°C) 
and 200 uL of micro-glass beads was added and the mixture vortexed vigorously. 
Tubes containing the mixtures were placed on dry ice/EtOH, or in a -80°C freezer, for 
about 30 min. The frozen contents in the tube were thawed at room temperature and 
vortexed vigorously again, and centrifuged at maximum speed for about 10 minutes. 
The supernatants were filtered using 0.2-0.45 micron filter plates, or syringe filters. 
[129] The spent medium was filtered using a 0.2-0.45 micron filter plate or syringe 
filter. The filtered spent medium was diluted 1:10 in -20°C methanol/water (final 
methanol concentration 50%). 

[130] The p-alanine content of cell extract and spent media was analyzed by 
LC/MS/MS (Example 8). 

For spent medium sample, the first minute was diverted to waste. The p-alanine peak 
arrived at approximately 2.0 minutes. 
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The assay can be scaled to 2ml, if only the spent media is analyzed. 
Example 8: Assay for p-alanine (LC/MS/MS) 

[131] P-alanine was determined using a combination of liquid chromatography and 
mass spectrometry. Suitable analytes were the cell extracts and spent media as 
prepared in Example 7. 

[132] The liquid chromatography (LC) phase was performed using an ASTEC 
CHIROBIOTIC™ T 4.6 cm x 50 mm chiral LC column (Advanced Separation 
Technologies, Inc., WTrippany, NJ. USA). The mobile phase consisted of two 
solutions: A: 0.25% aqueous acetic acid; and B: 0.25% (v/v) acetic acid in methanol. 
The elution was isocratic @ 0.6ml/minute. 

[133] The mass spectrometer (MS) analysis was performed on a Micromass Ultima 
Triple Quad mass spectrometer, using the following tune parameters: 
Capillary: 3.5 kV; cone: 20 V; hex 1: 15 V; aperture: 1.0V; source temp: 100°C; 
desolvation temp: 350°C; cone gas: 40 L/hr; desolvation gas: 500 L/h; low mass 
resolution(Ql): 12; high mass resolution (Ql): 12; ion energy (Ql): 0.1; collision cell 
entrance: -5; collision energy: 14; exit: 1; low mass resolution (Q2): 15 high mass 
resolution (Q2): 15; ion energy (Q2): 3.0; multiplier: 650 V. 

MS Method 
Alanine transitions 



Analyte 


Parent Ion (m/z) 


Daughter Ion (m/z) 


Dwell Time (sec) 


a-alanine 


90 


44.7 


0.1 


P-alanine 


90 


30.7 


0.1 


cc-lysine 


147 


84.5 


0.1 


P-lysine 


147 


70.5 


0.1 



The inter-channel delay was 0.1 seconds. 
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CLAIMS 

WHAT IS CLAIMED IS: 

1 . A polypeptide having alanine 2 , 3 -aminomutase activity (hereinafter an 
"AAM polypeptide") and 

(a) having an amino acid sequence selected from the group consisting of SEQ ID 
NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 
48 and 51: 

(b) having an amino acid sequence which has at least 98% homology with the amino 
acid sequence selected from the group consisting of SEQ ID NTO: 2, 22, 28, 32, and 
36; 

(c) having an amino acid sequence which has at least 99% homology with the amino 
acid sequence selected from the group consisting of SEQ ID NO: 4, 6, 8, 12, 16, 24, 
26, 30, 34 and 40; 

(d) being a polypeptide encoded by a nucleic acid sequence wlrich hybridizes under 
high stringency conditions with either (i) the nucleotide sequence of SEQ ID NO: 1, 
3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 41, 43, 45, 47 or 49; 
(ii) a subsequence of (i) of at least 100 nucleotides, or (iii) a complementary strand of 
(i)or(ii); or 

(e) being a variant of the polypeptide of (d) comprising a substitution, deletion, and/or 
insertion of one to six amino acids therefrom and having AAM activity from about 1 
to about 30 |jM P-alanine produced /hour 1 cell OD at pH 7.0-7 .6, 25°C. 

2. The polypeptide of claim 1 having an amino acid sequence selected 
from the group consisting of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 
28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48 and 51. 

3. The polypeptide of claim 1 having an amino acid sequence which has 
at least 98% homology with the amino acid sequence selected from the group 
consisting of SEQ ID NO: 2, 22, 28, 32, and 36. 
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4. The polypeptide of claim 1 having an amino acid sequence which has 
at least 99% homology with the amino acid sequence selected from the group 
consisting of SEQ ED NO: 4, 6, 8, 12, 16, 24, 26, 30, 34 and 40. 

5. The polypeptide of claim 1 being a polypeptide encoded by a nucleic 
acid sequence which hybridizes under high stringency conditions with either (i) the 
nucleotide sequence of SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21 , 23, 25, 27, 29, 
31, 33, 35, 37, 41, 43, 45, 47 or 49; (ii) a subsequence of (i) of at least 100 
nucleotides, or (iii) a complementary strand of (i) or (ii) 

6. The polypeptide of claim 1 being a variant of the polypeptide of (d) 
comprising a substitution, deletion, and/or insertion of one to six amino acids 
therefrom and having AAM activity from about 1 to about 30 uM P-alanine produced 
/hour 1/cell OD atpH 7.0-7.6, 25°C. 

7. An AAM polypeptide having an amino acid sequence of SEQ ID NO: 
2, 6, 12, 16, 20, 24, 28, 30, 32, 34, 38, 44, 46 or 48. 

8. The AAM polypeptide of claim 7 having an amino acid sequence of 
SEQ ID NO: 6, 12, 28, 34, 46 or 48. 

9. The AAM polypeptide of claim 8 having an amino acid sequence of 
SEQ ID NO: 28 or 34. 

10. A polynucleotide encoding an AAM polypeptide of claim 1 . 

11. A polynucleotide encoding a polypeptide having AAM activity, said 
polynucleotide having SEQ ED NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21 , 23, 25, 27, 29, 
31,33,35, 37, 41, 43, 45, 47 or 49. 

12. An isolated and purified polynucleotide which encodes a polypeptide 
of claim 1. 

13. An expression vector comprising a polynucleotide of claim 10 or 11 
operatively linked to a promoter. 
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1 4. A host cell transformed to express a polynucleotide of claim 1 0. 

15. A method of making an AAM polypeptide of claim 1, comprising (a) 
cultivating a host cell comprising a nucleic acid construct comprising a nucleic acid 
sequence encoding the AAM polypeptide under conditions suitable for production of 
the polypeptide; and (b) recovering the AAM polypeptide. 

16. An AAM polypeptide of claim 1 in lyophilized form. 

17. A composition comprising a polypeptide of claim 1 in a buffered 
medium. 

18. An AAM polypeptide having from 5 to 1 1 amino acid residue changes 
relative to SEQ ID NO: 59 or a fragment thereof, the residue changes including from 
1 to 3 residue changes selected from the group consisting of G308R, G308K, F4-16S, 
F416M, D447G, D447L, D447A, D447I andD447V. 
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SEQIDNO: 

1 

50 

P_6I2529467_G8_AAB81159.1 60 (1) _ _ 

MKNKWYKPKRHWKE IELWKD VPEEK^NpT^LlBQLTHTVRTDDDL KgviNJ/T 
P_GI2634361_EMB_CAB13860.i_ 61 (1) 

MKNKWYKPKRHWKEIELWKDlPEEK^^LMQLTHTWTLDDLKllVINljT 

P_S00701550 SS'^l) 
MKNKJreKPKRHWKEIELWKD[$PEEKj^^ 

P_S00701551 53 (1) 

MS LKD KFSTH^S Q ED]^IS^^K^C^E^'l"§Tjy^iiL IC§Y I P|T 

P_S007015S2 55 (1) 

MAES^RKYY|PD^T|iEQiYl|?:H^Q"2LNR|lTLDQLI^YgT^|T 

P_S01032894 57 (_1) — - 

MNTWTRKKFHPKf^gjE ElNpJlf QtoR|pj^DLE|Y|yDffli 

Consensus 62 (1) 
MKNKWYKPKRHWKEIELWKDVPEEKWNDWLWQLTHTVRTLDDLKKVINLT 



FIG. 4A 



51 

100 

P_GI2529467_G8_AAB81159.1_ (51) 

EDpE^^ISTKTIPLWl^YYAS^PDNPRCgvp^SVlLSElMHKTK 
P_GI2634361_EMB_CAB13860.1_ (51) 

ED|E|ffelSTKTIPLN^i|YYASfeM|PDNPRCi\@5|SV3LSE|WHKTK 

PJS00701550 (51) 
ED^EpfRISTKTIPLNI^YASJ,l^PDNPRCgv|M|sV^SE-|MHKTK 

p2s00701551 (41)" 
P^^EpGf^RCLDTiR^ITgYYLSpQ\^NPND^K^^L5LfiHfe}AA 

P_S00701552_ (43) _ 
A&EE^^ESPKV^RjfylA^^YYLS^I^P^NPNC'l I^tCvfiTr^TQQpfJvj^AP 

P S01032894 (44) 
E^EElTfeGVVR^ETBR^fTPPYFSglQLNSDRCtlP/ * r. WIR ^ XH QBD 

Consensus (51) 
EDEEEGVRI STKTI PLNITPYYASLMDPDNPRCPVRMQSVP1SEEMHKTK 



FIG. 4B 
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SEQUENCE LISTING 

<110> Cfciatterjee, Ranjini 
Chen, Michelle 
Louie, Susan 
Mitchell, Ken 
Fox, Richard 

<120> Improved Alanine 2 , 3-Aminorautases and Related Polynucleotides 
<130> 0359.210WO/15686WO02 
<160> 64 

<170> Patentln version 3.3 

<210> 1 

<211> 1416 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 



<400> X 



atgaaaaaca 


aatggtataa 


accgaaacgg 


cattggaagg 


agatcgagct 


atggaaggac 


60 


gttccggaag 


agaaatggaa 


cgattggctt 


tgacagctga 


cgcacactgt 


aagaacgtta 


120 


gatgatttaa 


agaaagtcat 


taatctgacc 


gaggatgaag 


aggaaggcgt 


ccgtatttct 


180 


accaaaacga 


tccccttaaa 


tattacacct 


tactatgctt 


ctttaatgga 


ccccgacaat 


240 


ccgagatgcc 


cggtacgcat 


gcagtctgtg 


ccgctttctg 


aagaaatgca 


caaaacaaaa 


300 


tacgatatgg 


aagacccgct 


tcatgaggat 


gaagattcac 


cggtacccgg 


tctgacacac 


360 


cgctatcccg 


accgtgtgct 


gtttcttgtc 


acgaatcaat 


gttccgtgta 


ctgccgccac 


420 


tgcacacgcc 


ggcgcttttc 


cggacaaatc 


ggaatgggcg 


tccccaaaaa 


acagcttgat 


480 


gctgcaattg 


cttatatccg 


ggaaacaccc 


gaaatccgcg 


attgtttaat 


ttcaggcggt 


540 


gatgggc tgc 


tcatcaacga 


ccaaatttta 


gaatatattt 


taaaagagct 


gcgcagcatt 


600 


ccgcatctgg 


aagtcatccg 


catcggaaca 


cgtgctcccg 


tcgtctttcc 


gcagcgcatt 


660 


accgatcatc 


tgtgcgagat 


attgaaaaaa 


taccatccgg 


tccggctgaa 


cacccatttt 


720 


aacacaagca 


tcgaaatgac 


agaagaaccc 


gttgaggcac 


gtgaaaagct 


ggtgaacgcg 


780 


ggagtgccgg 


tcggaaatca 


ggctgtcgta 


ttagcaggta 


ttaatggctc 


ggttccaatt 


840 


atgaaasagc 


tcatgcatga 


cttggtaaaa 


atcagagtcc 


gtccttatta 


tatttaccaa 


900 


tgtgatctgt 


cagaaggaat 


aaggcatttc 


cgtgctcctg 


tttccaaagg 


tttggagatc 


960 


attgaag-ggc 


tgagaggtca 


tacctcaggc 


tatgcggttc 


ctacctttgt 


cgttcacgca 


1020 
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ccaggcggag 


ggggtaaaat 


cgccctgcag ccgaactatg 


tcctgtctca aagtcccgac 


1080 


aaagtgatct 


taagaaattt 


tgaaggtgtg attacgtcat 


atccggaacc agagaattgt 


1140 


acccccaatc 


aggcagacgc 


ctattttgag tccgttttcc 


ctgaaaccgc tgacaaaaag 


1200 


gagccgatcg 


ggctgagtgc 


catttttgct gacaaagaag 


tfctcgtctac acccgaaaat 


1260 


gtagacagaa 


tcaaacggcg 


tgaggcatac atcgcaaatc 


ccjgagcatga aacattagaa 


1320 


gatcggcgtg 


agaaaagagg 


tcagctcaaa gaaaagaaat 


tfcttggcgca gcagaaaaaa 


1380 


cagaaagaga 


ctgaatgcgg 


aggggattct tcataa 




1416 



<210> 2 
<211> 471 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 2 

Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Tarp Lys Glu He Glu 
15 10 15 

Leu Trp Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Trp Gin 
20 25 30 

Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Val He Asn 
35 40 45 

Leu Thr Glu Asp Glu Glu Glu Gly Val Arg He Ser Thr Lys Thr He 
50 55 60 

Pro Leu Asn He Thr Pro Tyr Tyr Ala Ser Leu Mlet Asp Pro Asp Asn 
65 70 75 80 

Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Glu Glu Met 



His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu His Glu Asp Glu Asp 
100 105 HO 



Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro A^sp Arg Val Leu Phe 
115 120 125 
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Leu Val Thr Asn Gin Cys Ser Val Tyr Cys Arg His Cys Thr Arg Arg 
130 135 140 



Arg Phe Ser Gly Gin lie Gly Met Gly Val Pro Lys Lys Gin Leu Asp 
145 150 155 160 



Ala Ala He Ala Tyr He Arg Glu Thr Pro Glu He Arg Asp Cys Leu 
165 170 175 



He Ser Gly Gly Asp Gly Leu Leu He Asn Asp Gin He Leu Glu Tyr 
180 185 190 



He Leu Lys Glu Leu Arg Ser He Pro His Leu Glu Val He Arg He 
195 200 205 



Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg He Thr Asp His Leu 
210 215 220 



Cys Glu He Leu Lys Lys Tyr His Pro Val Arg Leu Asn Thr His Phe 
225 230 235 240 



Asn Thr Ser He Glu Met Thr Glu Glu Pro Val Glu Ala Arg Glu Lys 
245 250 255 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 265 270 



Gly He Asn Gly Ser Val Pro He Met Lys Lys Leu Met His Asp Leu 
275 280 285 



Val Lys He Arg Val Arg Pro Tyr Tyr He Tyr Gin Cys Asp Leu Ser 
290 295 3O0 



Glu Gly He Arg His Phe Arg Ala Pro Val Ser Lys Gly Leu Glu He 
305 310 315 320 



He Glu Gly Leu Arg Gly His Thr Ser Gly Tyr Ala Val Pro Thr Phe 
325 330 335 



Val Val His Ala Pro Gly Gly Gly Gly Lys He Ala Leu Gin Pro Asn 
340 345 350 



Tyr Val Leu Ser Gin Ser Pro Asp Lys Val He Leu Arg Asn Phe Glu 
355 360 365 
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Gly Val He Thr Ser Tyr Pro Glu Pro Glu Asn Cys Thr Pro Asn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Phe Pro Glu Thr Ala Asp Lys Lys 
385 390 395 400 



Glu Pro He Gly Leu Ser Ala He Phe Ala Asp Lys Glu Val Ser Ser 
405 410 415 



Thr Pro Glu Asn Val Asp Arg He Lys Arg Arg Glu Ala Tyr He Ala 
420 425 430 



Asn Pro Glu His Glu Thr Leu Glu Asp Arg Arg Glu Lys Arg Gly Gin 
435 440 445 



Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys Lys Gin Lys Glu Thr 
450 455 460 



Glu Cys Gly Gly Asp Ser Ser 
465 470 



<210> 3 

<211> 1416 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 



<400> 3 

atggaaaaca aatggtataa accgaaacgg cattggaagg agatcgagtt atggaaggac 60 

gttccggaag agaaatggaa cgattggctt tgacagctga cacacactgt aagaacgtta 120 

gatgatttaa agaaagtcat taatctgacc gaggatgaag aggjaaggcgt ccgtatttct 180 

accaaaacga tccccttaaa tattacacct tactatgctt ctttaatgga ccccgacaat 240 

ccgagatgcc cggtacgcat gcagtctgtg ccgctttctg aag-aaatgca caaaacaaaa 300 

tacgatatgg aagacccgct tcatgaggat gaagattcac cggjtgcccgg tctgacacac 360 

cgctatcccg accgtgtgct gtttcttgtc acgaatcagt gttccgtgta ctgccgccac 420 

tgcacacgcc ggcgcttttc cggacaaatc ggaatgggcg tccccaaaaa acagcttgat 480 

gctgcaattg cttatatccg ggaaacaccc gaaatccgcg attgtttaat ttcaggcggt 540 

gatgggctgc tcatcaacga ccaaatttta gaatatattt taaaagagct gcgcagcatt 600 
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-5- 



ccgcatctgg 


aagtcatccg 


catcggaaca 


cg-tgctcccg 


tcgtctttcc 


gcagcgcgtt 


660 


accgatcatc 


tgtgcgagat 


attgaaaaaa 


tatcatccgg 


tctggctgga 


cacccatttt 


720 


aacacaagca 


tcgaaatgac 


agaagaatcc 


gfc. tgaggcat 


gtgaaaagct 


ggtgaacgcg 


780 


ggagtgccgg 


tcggaaatca 


ggctgtcgta 


ttagcaggta 


ttaatgattc 


ggttccaatt 


840 


atgaaaaagc 


tcatgcatga 


cttggtaaaa 


atcagagtcc 


gtccttatta 


tatttaccaa 


900 


tgtgatctgt 


cagaaggaat 


aaggcatttc 


cg-tgctcctg 


tttccaaagg 


tttggagatc 


960 


attgaagggc 


tgagaggtca 


tacctcaggc 


ta-tgcggttc 


ctacctttgt 


cgttcacgca 


1020 


ccaggcggag 


gaggtaaaat 


cgccctgcag 


ccgaactatg 


tcctgtctca 


aagtcctggc 


1080 


agagtgatct 


taagaaattt 


tgaaggtgtg 


at tacgtcat 


acccggaacc 


agagaattat 


1140 


atccccaatc 


aggcagacgc 


ctattttgag 


tccgttttcc 


ctgaaaccgc 


tgacaaaaag 


1200 


gagccgatcg 


ggctgagtgc 


catttttgct 


gacaaagaag 


tttcgtctac 


acctgaaaat 


1260 


gtagacagaa 


tcaaacggcg 


tgaggcatac 


atcgcaaatc 


cggagcatga 


aacattaaaa 


1320 


gatcggcgtg 


agaaaagagg 


tcagctcaaa 


gaaaagaaat 


ttttggcgca 


gcagaaaaaa 


1380 


cagaaagaga 


ctgaatgcgg 


aggggattct 


tcataa 






1416 



<210> 4 
<211> 471 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 4 

Met Glu Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu He Glu 
15 10 15 

Leu Trp Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Trp Gin 
20 25 30 

Leu Thr His Thr Val Arg Thr Leu Asrp Asp Leu Lys Lys Val He Asn 
35 40 45 

Leu Thr Glu Asp Glu Glu Glu Gly Val Arg He Ser Thr Lys Thr He 
50 55 60 

Pro Leu Asn He Thr Pro Tyr Tyr Ala. Ser Leu Met Asp Pro Asp Asn 
65 70 75 80 
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Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Glu Glu Met 



His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu His Glu Asp Glu Asp 
100 105 HO 

Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro Asp Arg Val Leu Phe 
115 120 125 

Leu Val Thr Asn Gin Cys Ser Val Tyr Cys Arg His Cys Thr Arg Arg 
130 135 140 

Arg Phe Ser Gly Gin He Gly Met Gly Val Pro Lys Lys Gin Leu Asp 
145 150 155 160 



Ala Ala He Ala Tyr He Arg Glu Thr Pro Glu He Arg Asp Cys Leu 
165 170 175 

He Ser Gly Gly Asp Gly Leu Leu He Asn Asp Gin He Leu Glu Tyr 
180 185 190 

He Leu Lys Glu Leu Arg Ser He Pro His Leu Glu Val He Arg He 
195 200 205 

Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg Val Thr Asp His Leu 
210 215 220 

Cys Glu He Leu Lys Lys Tyr His Pro Val Tirp Leu Asp Thr His Phe 
225 230 235 240 

Asn Thr Ser He Glu Met Thr Glu Glu Ser Val Glu Ala Cys Glu Lys 
245 250 255 

Leu Val Asn Ala Gly Val Pro Val Gly Asn GLn Ala Val Val Leu Ala 
260 265 270 

Gly He Asn Asp Ser Val Pro He Met Lys hys Leu Met His Asp Leu 
275 280 285 



Val Lys He Arg Val Arg Pro Tyr Tyr He Tyr Gin Cys Asp Leu Ser 
290 295 300 
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Glu Gly lie Arg His Phe Arg Ala Pro Val Ser Lys Gly Leu Glu lie 
305 310 315 320 



lie Glu Gly Leu Arg Gly His Thr Se:r Gly Tyr Ala Val Pro Thr Phe 
325 330 335 



Val Val His Ala Pro Gly Gly Gly Gly Lys lie Ala Leu Gin Pro Asn 
340 345 350 



Tyr Val Leu Ser Gin Ser Pro Gly Arg Val lie Leu Arg Asn Phe Glu 
355 360 365 



Gly Val lie Thr Ser Tyr Pro Glu Pro Glu Asn Tyr lie Pro Asn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Ph.<a Pro Glu Thr Ala Asp Lys Lys 
385 390 395 400 



Glu Pro He Gly Leu Ser Ala He Ph.e Ala Asp Lys Glu Val Ser Ser 
405 410 415 



Thr Pro Glu Asn Val Asp Arg He Lys Arg Arg Glu Ala Tyr He Ala 
420 42 5 430 



Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg Gly Gin 
435 440 445 



Leu Lys Glu Lys Lys Phe Leu Ala GLn Gin Lys Lys Gin Lys Glu Thr 
450 455 460 



Glu Cys Gly Gly Asp Ser Ser 
465 470 



<210> 5 
<211> 1416 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 5 

atgaaaaaca aatggtataa accgaaacgg cattggaagg agatcgagtt atggaaggac 60 
gttccggaag ggaaatggaa cgattggctt tgacagctga cacacactgt aagaacgtta 120 
gatgatttaa agaaagtcat taatctgacc cjaggatgaag aggaaggcgt ccgtatttct 180 
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accaaaacga tccccttaaa tattacacct tactatgctt ctttaatgga ccccgacaat 24 0 

ccgagatgcc cggtacgcat gcagtctgtg ccgctttctg aagaaataca caaaacaaaa 30 0 

tacgatatgg aagacccgct tcatggggat gaagactcac cggtacccgg tctgacacac 36 0 

cgctatcccg accgtgtgct gtttcttgtc acgaatcaat gttctgtgta ctgccgccac 42 0 

tgcacacgcc ggcgcttttc cggacaaatc ggaatgggcg tccccaaaaa acagcttgat 48 0 

gctgcaattg cttatatccg ggaaacaccc gaaatccgcg attgtttaat ttcaggcggt 54 0 

gatgggctgc tcatcaacga ccaaatttta gaatatattt taaaagagct gcgcagcatt 60 0 

ccgcatctgg aagtcatccg catcggaaca cgtgcccccg tcgtctttcc gcagcgcatt 65 0 

accgatcatc tgtgcgagat attgaaaaaa tatcatccgg tctggctgaa cacccatttt 72 0 

aacacaagca tcgaaatgac agaagaatcc gttgaggcat gtgaaaagct ggtgaacgcg 78 0 

ggagtgccgg tcggaaatca ggctgtcgta ttagcaggta ttaatgattc ggttccaatt 84 0 

atgaaaaagc tcatgcatga cttggtaaaa atcagagtcc gtccttatta tatttaccaa 90 0 

tgtgatctgt cagaaggaat aaggcatttc cgtgctcctg tttccaaagg tttggagatc 9S0 

attgaagggc tgagaggtca tacctcaggc tatgcggttc ctacctttgt cgttcacgca 102 0 

ccaggcggag gaggtaaaat cgccctgcag ccgaactatg tcctgtctca aagtcctgac 108 0 

aaagtgatct taagaaattt tgaaggtgtg attacgtcat atccggaacc agagaattat 114,0 

atccccaatc aggcagacgc ctattttgag tccgttttcc ctgaaaccgc tgacaaaaag 12O0 

gagccgatcg ggctgaghgc catttttgct gacaaagaag tttcgtctac acctgaaaat 12S0 

gtagacagaa tcaaacggcg tgaggcatac atcgcaaatc cggagcatga aacattaaaa 132 0 

gatcggcgtg agaaaagagg tcagctcaaa gaaaagaaat ttttggcgca gcagaaaaaa 138 0 

cagaaagaga ctgaatgcgg aggggattct tcataa 143_6 

<210> 6 
<211> 471 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 6 



Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu lie Glu 
15 10 15 
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Leu Trp Lys Asp Val Pro Glu Gly Lys Trp Asn Asp Trp Leu Trp Gin 
20 25 30 

Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Val He Asn 



Leu Thr Glu Asp Glu Glu Glu Gly Val Arg He Ser Thr Lys Thr lie 
50 55 60 



Pro Leu Asn He Thr Pro Tyr Tyr Ala Ser Leu Met Asp Pro Asp Asn 
65 70 75 80 

Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Glu Glu He 



His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu His Gly Asp Glu Asp 
100 105 HQ 



• Pro Val Pro Gly Leu Thr His Arg Tyr Pro Asp Arg Val Leu Phe 
115 120 125 



Leu Val Thr Asn Gin Cys Ser Val Tyr Cys Arg His Cys Thr Arg Arg 
130 135 140 



Arg Phe Ser Gly Gin He Gly Met Gly Val Pro Lys Lys Gin Leu Asp 
145 150 155 160 

Ala Ala He Ala Tyr He Arg Glu Thr Pro Glu He Arg Asp Cys Leu 
165 170 175 

He Ser Gly Gly Asp Gly Leu Leu He Asn Asp Gin He Leu Glu Tyr 
180 185 190 



He Leu Lys Glu Leu Arg Ser He Pro His Leu Glu Val He Arg lie 
195 200 205 



Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg He Thr Asp His Leu 
210 215 220 



Cys Glu He Leu Lys Lys Tyr His Pro Val Trp Leu Asn Thr His Phe 
225 230 235 240 



Asn Thr Ser He Glu Met Thr Glu Glu Ser Val Glu Ala Cys Glu Lys 
245 2 50 255 
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Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 265 270 



Gly lie Asn Asp Ser Val Pro lie Met Lys Lys Leu Met His Asp Leu 
275 280 285 



Val Lys lie Arg Val Arg Pro Tyr Tyr He Tyr Gin Cys Asp Leu Ser 
290 295 300 



Glu Gly He Arg His Phe Arg Ala Pro Val Ser Lys Gly Leu Glu He 
305 310 315 320 



He Glu Gly Leu Arg Gly His Thr Ser Gly Tyr Ala Val Pro Thr Phe 
325 330 335 



Val Val His Ala Pro Gly Gly Gly Gly Lys He Ala Leu Gin Pro Asn 
340 345 350 



Tyr Val Leu Ser Gin Ser Pro Asp Lys Val He Leu Arg Asn Phe Glu 
355 360 365 



Gly Val He Thr Ser Tyr Pro Glu Pro Glu Asn Tyr He Pro Asn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Phe Pro Glu Thr Ala Asp Lys Lys 
385 390 395 400 



Glu Pro He Gly Leu Ser Ala He Phe Ala Asp Lys Glu Val Ser Ser 
405 410 415 



Thr Pro Glu Asn Val Asp Arg He Lys Arg Arg Glu Ala Tyr He Ala 
420 425 430 



Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg Gly Gin 
435 440 445 



Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys Lys Gin Lys Glu Thr 
450 455 460 



Glu Cys Gly Gly Asp Ser Ser 
465 470 
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<210> 7 
<211> 1416 
<212> DNA 

<213> Artificial Sequence 








<220> 

<223> Synthetic Construct 








atgaaaaaca 


aatggtataa 


accgaaacgg 


cattggaagg 


agatcgagtt atggaaggac 


60 


gttccggaag 


agaaatgg-aa 


cgattggctt 


tgacagctga 


cacacactgt aagaacgtta 


120 


gatgatttaa 


agaaagtcat 


taatctgacc 


gaggatgaag 


aggaaggcgt ccgtatttct 


180 


accaaaacga 


tccccttaaa 


tattacacca 


tactatgcga 


gcttaatgga tccagaaaac 


240 


ccacgttgtc 


cggtacgcat 


gcagtctgtg 


ccgctttccg 


aagaaatgca caaaacaaaa 


300 


tacgatatgg 


aagacccgct 


tcatgaggat 


gaagattcac 


cggtacccgg tctgacacac 


360 


cgctatcccg 


accgtgt^ct 


gtttcttgtc 


acgaatcaat 


gttccgtgta ctgccgccac 


420 


tgcacacgcc 


ggcgcttttc 


cggacaaatc 


ggaatgggcg 


tccccaaaaa acagcttgat 


480 


gctgcaattg 


cttatatccg 


ggaaacaccc 


gaaatccgcg 


attgtttaat ttcaggcggt 


540 


gatgggctgc 


tcatcaacga 


ccaaatttta 


gaatatattt 


taaaagagct gcgcagcatt 


600 


ccgcatctgg 


aagtcatccg 


catcggaaca 


cgtgctcccg 


tcgtctttcc gcagcgcatt 


660 


accgatcatc 


cgtgcgagat 


attgaaaaaa 


tatcatccgg 


tctggctgaa cacccatttt 


720 


aacacaagca 


tcgaaat^ac 


agaagaatcc 


gttgaggcat 


gtgaaaagct ggtgaacgcg 


780 


ggagtgccgg 


tcggaaa-fcca 


ggctgtcgta 


ttagcaggta 


ttaatgattc ggttccaatt 


840 


atgaaaaagc 


tcatgca-tga 


cttggtaaaa 


atcagagtcc 


gtccttatta tatttaccaa 


900 


tgtgatctgt 


cagaaggaat 


aaggcatttc 


cgtgctcctg 


tctccaaagg tttggagatc 


960 


attgaagggc 


tgagagg-fcca 


taccccaggc 


tatgcggttc 


ctacctttgt cgttcacgca 


1020 


ccaggcggag 


gaggtaaaat 


cgccctgcag 


ccgaactatg 


tcctgtctca aagtcctgac 


1080 


aaagtgatct 


taagaaattt 


tgaaggtgtg 


attacgtcat 


atccggaacc agagaattat 


1140 


atccccaatc 


aggcagacgc 


ctattttgag 


tccgtttccc 


ctgaaaccgc tgacaaaaag 


1200 


gagccgatcg 


ggctgag-fcgc 


catttttgct 


gacaaagaag 


tttcgtctac acctgaaaat 


12 60 


gtagacagaa 


tcaaacggcg 


tgaggcctac 


atcgcaaatc 


cggagcatga aacattaaaa 


1320 


gatcggcgtg 


agaaaagagg 


tcagctcaaa 


gaaaagaaat 


tttcggcgca gcagaaaaaa 


1380 


cagaaagaga 


ctgaatgcgg 


aggggattct 


tcataa 




1416 
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<210> 8 
<211> 471 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 8 

Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu He Glu 
15 10 15 



Leu Trp Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Trp Gin 

20 25 30 

Leu Thr His Thr Val Arg Thar Leu Asp Asp Leu Lys Lys Val He Asn 

35 40 45 

Leu Thr Glu Asp Glu Glu Glu Gly Val Arg He Ser Thr Lys Thr lie 

50 55 60 

Pro Leu Asn He Thr Pro Tyx Tyr Ala Ser Leu Met Asp Pro Glu Asn 



Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Glu Glu Met 
85 90 95 

His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu His Glu Asp Glu Asp 
100 105 no 



Ser Pro Val Pro Gly Leu Thx His Arg Tyr Pro Asp Arg Val Leu Phe 
115 120 125 



Leu Val Thr Asn Gin Cys Sex Val Tyr Cys Arg His Cys Thr Arg Arg 
130 13 5 140 



Arg Phe Ser Gly Gin He Gly Met Gly Val Pro Lys Lys Gin Leu Asp 

145 150 155 ~ 160 

Ala Ala He Ala Tyr He Arg Glu Thr Pro Glu He Arg Asp Cys Leu 

165 170 175 



He Ser Gly Gly Asp Gly Leu. Leu He Asn Asp Gin He Leu Glu Tyr 
180 185 190 
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lie Leu Lys Glu Leu Arg Ser lie Pro His Leu Glu Val 13_e Arg lie 
195 200 205 



Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg He Thr Asp His Pro 
210 215 220 



Cys Glu He Leu Lys Lys Tyr His Pro Val Trp Leu Asn Thr His Phe 
225 230 235 240 



Asn Thr Ser He Glu Met Thr Glu Glu Ser Val Glu Ala Cys Glu Lys 
245 250 ~ 255 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 265 2-70 



Gly He Asn Asp Ser Val Pro He Met Lys Lys Leu Met His Asp Leu 
275 280 285 



Val Lys He Arg Val Arg Pro Tyr Tyr He Tyr Gin Cys Asp Leu Ser 
290 295 300 



Glu Gly He Arg His Phe Arg Ala Pro Val Ser Lys Gly Leu Glu He 
305 310 315 320 



He Glu Gly Leu Arg Gly His Thr Pro Gly Tyr Ala Val Paro Thr Phe 
325 330 335 



Val Val His Ala Pro Gly Gly Gly Gly Lys He Ala Leu Gin Pro Asn 
340 345 3 50 



Tyr Val Leu Ser Gin Ser Pro Asp Lys Val He Leu Arg Asn Phe Glu 
355 360 365 



Gly Val He Thr Ser Tyr Pro Glu Pro Glu Asn Tyr He Pro Asn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Ser Pro Glu Thr Ala. A.sp Lys Lys 
385 390 395 400 

Glu Pro He Gly Leu Ser Ala He Phe Ala Asp Lys Glu Val Ser Ser 
405 410 * 415 

Thr Pro Glu Asn Val Asp Arg He Lys Arg Arg Glu Ala Tyr He Ala 
420 425 430 
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Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg Gly Gin 
435 440 445 

Leu Lys Glu Lys Lys Phe Ser Ala Gin Gin Lys Lys Gin Lys Glu Thr 
450 455 460 

Glu Cys Gly Gly Asp Ser Ser 



<210> 9 
<211> 1416 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 9 

atgaaaaaca aatggtataa accgaaacgg cattggaagg agatcgagtt atggaaggac 60 

gttccggaag agaaatggaa cgattggctt tgacagctga cacacactgt aagaacgtta 120 

gatgatttaa agaaagtcat taatctgacc gaggatgaag aggaaggcgt ccgtatttct 180 

accaaaacga tccccttaaa tattacacct tactatgctt ctttaatgga ccccgacaat 240 

ccgagatgcc cggtgcgcat gcagtctgtg ccgctttctg aagaaatgca caaaacaaaa 300 

tacgatatgg aagacccgct tcatgaggat gaagattcac cggtacccgg tctgacacac 360 

cgctatcccg accgtgtgct gtttcttgtc acgaatcaat gttccgtgta ctgccgccac 420 

tgcacacgcc ggcgcttttc cggacaaatc ggaatgggcg tccccaaaaa acagcttgat 480 

gctgcaattg cttatatccg ggaaacaccc gaaatccgcg attgtttaat ttcaggcggt 540 

gatgggctgc tcatcaacga ccaaatttta gaatatattt taaaagagct gcgcagcatt 600 

ccgcatctgg aagtcatccg catcggaaca cgtgctcccg tcgtctttcc gcagcgcatt 660 

accgatcatc tgtgcgagat attgaaaaaa tatcatccgg tctggctgaa cacccatttt 720 

aacacaagca tcgaaatgac agaagaatcc gttgaggcat gtgaaaagct ggtgaacgcg 780 

ggagtgccgg tcggaaatca ggctgtcgta ttagcaggta ttaatgattc gg-ttccaatt 840 

atgaaaaagc tcatgcatga cttggtaaaa atcagagtcc gtccttatta tgtttaccaa 900 

tgtgatctgt cagaaggaat aaggcatttc cgtgctcctg tttccaaagg tttggagatc 960 

attgaagggc tgagaggtca tacctcaggc tatgcggttc ctacctttgt cg-ttcacgca 1020 

ccaggcggag gaggtaaaat cgccctgcag ccgaactatg tcctgtctca aagtcctga£ 1080 
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aaagtgatct 


taagaaattt tgaaggtgtg attacgtcat atccggaacc agagaattat 


1140 


atccccaatc 


aggcagacgc ctattttgag tccgttttcc ctgaaaccgc tgacaaaaag 


1200 


gagccgatcg 


ggctgagtgc catttttgct gacaaagaag tttcgtctac acctgaaaat 


1260 


gtagacagaa 


tcaaacggcg tgaggcatac atcgcaaatc cggagcatga aacattaaaa 


1320 


gatcggcgtg 


agaaaagagg tcagctcaaa gaaaagaaat ttttggcgca gcagaaaaaa 


1380 


cagaaagaga 


ctgaatgcgg aggggattct tcataa 


1416 



<210> 10 
<211> 471 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 10 

Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu He Glu 
15 10 15 



Leu Trp Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Trp Gin 
20 25 30 

Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Val He Asn 
35 40 45 



Leu Thr Glu Asp Glu Glu Glu Gly Val Arg He Ser Thr Lys Thr He 
50 55 60 



Pro Leu Asn He Thr Pro Tyr Tyr Ala Ser Leu Met Asp Pro Asp Asn 
65 70 75 80 

Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Glu Glu Met 
85 90 95 

His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu His Glu Asp Glu Asp 
100 105 no 

Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro Asp Arg Val Leu Phe 
115 120 125 



Leu Val Thr Asn Gin Cys Ser Val Tyr Cys Arg His Cys Thr Arg Arg 
130 135 140 
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Arg Phe Ser Gly Gin lie Gly Met Gly Val Pro Lys lys Gin Leu Asp 
145 150 155 160 



Ala Ala He Ala Tyr He Arg Glu Thr Pro Glu He Arg Asp Cys Leu 
165 170 175 



He Ser Gly Gly Asp Gly Leu Leu He Asn Asp Gin He Leu Glu Tyr 
180 185 190 



He Leu Lys Glu Leu Arg Ser He Pro His Leu Glu Val He Arg He 
195 ~ 200 205 



Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg He Thr Asp His Leu 
210 215 220 



Cys Glu He Leu Lys Lys Tyr His Pro Val Trp Leu Asn Thr His Phe 
225 230 235 240 



Asn Thr Ser He Glu Met Thr Glu Glu Ser Val Glu Ala Cys Glu Lys 
245 250 255 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala "Val Val Leu Ala 
260 265 270 



Gly He Asn Asp Ser Val Pro He Met Lys Lys Leu Met His Asp Leu 
275 280 285 



Val Lys He Arg Val Arg Pro Tyr Tyr Val Tyr Gin Cys Asp Leu Ser 
290 295 300 



Glu Gly He Arg His Phe Arg Ala Pro Val Ser Lys Gly Leu Glu He 
305 310 315 320 



He Glu Gly Leu Arg Gly His Thr Ser Gly Tyr Ala Val Pro Thr Phe 
325 330 335 



Val Val His Ala Pro Gly Gly Gly Gly Lys He Ala leu Gin Pro Asn 
340 345 350 



Tyr Val Leu Ser Gin Ser Pro Asp Lys Val He Leu Arg Asn Phe Glu 
355 360 365 
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Gly Val He Thr Ser Tyr Pro Glu Pro Glu Asn Tyr He Pro Asn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Phe Pro Glu Thr Ala Asp Lys Lys 
385 390 395 400 



Glu Pro He Gly Leu Ser Ala He Phe Ala Asp Lys Glu Val Ser Ser 
405 410 415 



Thr Pro Glu Asn Val Asp Arg He Lys Arg Arg Glu Ala Tyr He Ala 
420 425 430 



Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg Gly Gin 
435 440 445 



Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys Lys Gin Lys Glu Thr 
450 455 450 



Glu Cys Gly Gly Asp Ser Ser 
465 470 



<210> 11 
<211> 1416 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 11 

atgaaaaaca aatggtataa accgaaacgg cattggaagg agatcgagtt atggaaggac 60 
gtcccggaag agaaatggaa cgattggctt tgacagctga cacacac-fcgt aggaacgtta 120 
gatgatttaa agaaagtcat caatctgacc gaggatgaag aggaaggcgt ccgtatttct 180 
accaaaacga tccccttaaa tattacacct tactatgctt ctttaatgga ccccgacaat 240 
ccgagatgcc cggtacgcat gcagtctgtg ccgctttctg aagaaatgca caaaacaaaa 300 
tacgatatgg aagacccgct tcatgaggat gaagattcac cggtacccgg tctgacacac 360 
cgctatcccg accgtgtgct gtttcttgtc acgaatcaag gttccgfcgta ctgccgccac 420 
cgcacacgcc ggcgcttttc cggacaaatc ggaatgggcg tccccaaaaa acagcttgat 480 
gctgcaattg cttatatccg ggaaacaccc gaaatccgcg attgttbaat ttcaggcggt 540 
gatgggctgc tcatcaacga ccaaatttta gaatatattt taaaagagct gcgcagcatt 600 
ccgcatccgg aagtcatccg catcggaaca cgtgctcccg tcgtcttccc gcagcgcatt 660 
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accgatcatc 


tgtgcgagat 


attgaaaaaa 


tatcatccgg 


tctggctgaa 


cacccatttt 


720 


aacacaagca 


tcgaaatgac 


agaagaatcc 


gttgaggcat 


gtgaaaagct 


ggtgaacgcg 


780 


ggagtgccgg 


tcggaaatca 


ggctgtcgta 


ttagcaggta 


ttaatgattc 


ggttccaact 


840 


atgaaaaagc 


tcatgcatga 


cttggtaaaa 


atcagagtcc 


gtccttatta 


tatttaccaa 


900 


tgtgatctgt 


cagaaggaat 


aaggcatttc 


cgtgctcctg 


tttccaaagg 


tttggagatc 


960 


attgaagggc 


tgagaggcca 


tacctcaggc 


tatgcggttc 


ctacctttgt 


cgttcacgca 


1020 


ccaggcggag 


gaggtaaaat 


cgccctgcag 


ccgaactatg 


tcctgtctca 


aagtcctgac 


1080 


aaagtgatct 


taagaaattt 


tgaaggtgtg 


attacgtcat 


atccggaacc 


agagaattat 


1140 


atccccaatc 


aggcagacgc 


ctattttgag 


tccgttttcc 


ctgaaaccgc 


tgacaaaaag 


1200 


gagccgatcg 


ggctgagtgc 


catttttgct 


gacaaagaag 


tttcgtctac 


acctgaaaat 


1260 


gtagacagaa 


tcaaacggcg 


tgaggcatac 


atcgcaaatc 


cggagcatga 


aacattaaaa 


1320 


gatcggcgtg 


agaaaagagg 


tcagctcaaa 


gaaaagaaat 


ttttggcgca 


gcagaaaaaa 


1380 


cagaaagaga 


ctgaatgcgg 


aggggattct 


tcataa 






1416 



<210> 12 
<211> 471 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 12 

Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu lie Glu 
15 10 15 

Leu Trp Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Trp Gin 
20 25 30 

Leu Thr His Thr Val Gly Thr Leu Asp Asp Leu Lys Lys Val lie Asn 
35 40 ' 45 

Leu Thr Glu Asp Glu Glu Glu Gly Val Arg lie Ser Thr Lys Thr lie 
50 55 60 



Pro Leu Asn lie Thr Pro Tyr Tyr Ala Ser Leu Met Asp Pro Asp Asn 
65 70 75 80 
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Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Glu Glu Met 



His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu His Glu Asp Glu Asp 
100 105 110 



Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro Asp Arg Val Leu Phe 
115 120 125 



Leu Val Thr Asn Gin Gly Ser Val Tyr Cys Arg His Arg Thr Arg Arg 
130 135 140 



Arg Phe Ser Gly Gin lie Gly Met Gly Val Pro Lys Lys Gin Leu Asp 
145 150 155 160 



Ala Ala lie Ala Tyr He Arg Glu Thr Pro Glu He Arg Asp Cys Leu 
165 170 175 



He Ser Gly Gly Asp Gly Leu Leu He Asn Asp Gin He Leu Glu Tyr 
180 185 190 



He Leu Lys Glu Leu Arg Ser He Pro His Pro Glu Val He Arg He 
195 200 205 



Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg He Thr Asp His Leu 
210 215 22 0 



Cys Glu He Leu Lys Lys Tyr His Pro Val Trp Leu Asn Thr His Phe 
225 230 235 240 



Asn Thr Ser He Glu Met Thr Glu Glu Ser Val Glu Ala Cys Glu Lys 
245 250 255 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 265 270 



Gly He Asn Asp Ser Val Pro Thr Met Lys Lys Le-u Met His Asp Leu 
275 280 285 



Val Lys He Arg Val Arg Pro Tyr Tyr He Tyr Gin Cys Asp Leu Ser 
290 295 300 



Glu Gly He Arg His Phe Arg Ala Pro Val Ser Lys Gly Leu Glu He 
305 310 315 " 320 
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lie Glu Gly Leu Arg Gly His Thr Ser Gl:y Tyr Ala Val Pro Thr Phe 
325 33 0 335 



Val Val His Ala Pro Gly Gly Gly Gly Lys He Ala Leu Gin Pro Asn 
340 345 350 



Tyr Val Leu Ser Gin Ser Pro Asp Lys Val He Leu Arg Asn Phe Glu 
355 360 365 



Gly Val He Thr Ser Tyr Pro Glu Pro Glu Asn Tyr He Pro Asn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Phe Prro Glu Thr Ala Asp Lys Lys 
385 390 395 400 



Glu Pro He Gly Leu Ser Ala He Phe Ala Asp Lys Glu Val Ser Ser 
405 410 415 



Thr Pro Glu Asn Val Asp Arg He Lys Aarg Arg Glu Ala Tyr He Ala 
420 425 430 



Asn Pro Glu His Glu Thr Leu Lys Asp Aarg Arg Glu Lys Arg Gly Gin 
435 440 445 



Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys Lys Gin Lys Glu Thr 
450 455 460 



Glu Cys Gly Gly Asp Ser Ser 
465 470 

<210> 13 
<211> 1416 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 13 

atgaaaaaca aatggtataa accgaaacgg catfcgggagg agatcgagcg atggaaggac 60 
gttccggaag agaaatggaa cgattggctt tgacagctga cacacactgt aagaacgtta 120 
gatgatttaa agaaagtcat taatctgacc gagcjatgaag aggaaggcgt ccgtatttct 180 
accaaaacga tccccttaaa tattacacct tactatgctt ccttaatgga ccccgacaat. 240 
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ccgagatgcc cggtacgcat gcagtctgtg ccgctttctg aagaaatgca caaaacaaaa 3 00 

tacgatatgg aagacccgct tcatgaggat gaagattcac cggtacccgg tctgacacac 3 60 

cgctatcccg accgtgtgct gtttcttgtc acgaatcaat gttccgtgta ctgccgccac 420 

tgcacacgcc ggcgcttttc cggacaaatc gggatgggcg tccccaaaaa acagcttgat 480 

gctgcaattg cttatatccg ggaaacaccc gaaatccgcg attgtttaat ttcaggcggt 540 

gatgggctgc tcatcaacga ccaaatttta gaatatattt taaaagagcc gcgcagcact 600 

ccgcatctgg aagtcatccg catcggaaca cgtgctcccg tcgtctttcc gcagcgcatt 660 

accgatcatc tgtgcgagat attgaaaaaa tatcatccgg tctggctgaa cacccatttt 720 

aacacaagca tcgaaatgac agaagaatcc gttgaggcat gtgaaaagct ggtgaacgcg 780 

ggagtgccgg tcggaaatca ggctgtcgta ttagcaggta ttaatgattc ggttccaatt 840 

gtgaaaaagc tcatgcatga cttggtaaaa atcagagtcc gtccttatta tatttaccaa 900 

tgtgatctgt cagaaggaat aaggcattcc cgtgctcctg tttccaaagg tttggagatc 960 

attgaagggc tgagaggtca tacctcaggc tatgcggttc ctacctttgt cgttcacgca 1020 

ccaggcggag gaggtaaaat cgcccbgcag ccgaactatg tcctgtctca aagtcctgac 1080 

aaagtgatct taagaaattt tgaaggtgtg attacgtcat atccggaacc agagaattat 1140 

atccccaatc aggcagacgc ctattttgag tccgttttcc ctgaaaccgc tgacaaaaag 1200 

gagccgatcg ggctgagtgc catttttgct gacaaagaag tttcgtctac acctgaaaat 1260 

gtagacagaa tcaaacggcg tgaggcatac atcgcaaatc cggagcatga aacattaaaa 1320 

gatcggcgtg agaaaagagg tcagctcaaa gaaaagaaat ttttggcgca gcagaaaaaa 1380 

cagaaagaga ctgaatgcgg aggggattct tcataa 1416 

<210> 14 
<211> 471 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 14 

Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Glu Glu He Glu 



Arg Trp Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Trp Gin 
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Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Val lie Asn 



Leu Thr Glu Asp Glu Glu Glu Gly Val Arg lie Ser Thr Lys Thr lie 



Pro Leu Asn lie Thr Pro Tyr Tyr Ala Ser Leu Met Asp Pro Asp Asn 
65 70 75 80 



Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Glu Glu Met 



His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu His Glu Asp Glu Asp 
100 105 110 



Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro Asp Arg Val Leu Phe 
115 120 125 



Leu Val Thr Asn Gin Cys Ser Val Tyr Cys Arg His Cys Thr Arg Arg 
130 135 140 



Arg Phe Ser Gly Gin He Gly Met Gly Val Pro Lys Lys Gin Leu Asp 
145 150 155 160 



Ala Ala He Ala Tyr He Arg Glu Thr Pro Glu He Arg Asp Cys Leu 
165 170 175 



He Ser Gly Gly Asp Gly Leu Leu He Asn Asp Gin He Leu Glu Tyr 
180 185 190 



He Leu Lys Glu Pro Arg Ser Thr Pro His Leu Glu Val He Arg He 
195 200 205 



Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg He Thr Asp His Leu 
210 215 220 



Cys Glu He Leu Lys Lys Tyr His Pro Val Trp Leu Asn Thr His Phe 
225 230 235 240 



Asn Thr Ser He Glu Met Thr Glu Glu Ser Val Glu Ala Cys Glu Lys 
245 250 255 
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Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 265 270 



Gly lie Asn Asp Ser Val Pro lie Val Lys Lys Leu Met His Asp Leu 
275 280 285 



Val Lys He Arg Val Arg Pro Tyr "Tyr He Tyr Gin Cys Asp Leu Ser 
290 295 300 



Glu Gly He Arg His Ser Arg Ala Pxo Val Ser Lys Gly Leu Glu He 
305 310 315 320 



He Glu Gly Leu Arg Gly His Thr Ser Gly Tyr Ala Val Pro Thr Phe 
325 330 335 



Val Val His Ala Pro Gly Gly Gly Gly Lys He Ala Leu Gin Pro Asn 
340 345 350 



Tyr Val Leu Ser Gin Ser Pro Asp 1,-ys Val He Leu Arg Asn Phe Glu 
355 360 365 



Gly Val He Thr Ser Tyr Pro Glu P:ro Glu Asn Tyr He Pro Asn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Prie Pro Glu Thr Ala Asp Lys Lys 
385 390 395 400 



Glu Pro He Gly Leu Ser Ala He Pile Ala Asp Lys Glu Val Ser Ser 
405 410 415 



Thr Pro Glu Asn Val Asp Arg He L^s Arg Arg Glu Ala Tyr He Ala 
420 425 430 



Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg Gly Gin 
435 440 445 



Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys Lys Gin Lys Glu Thr 
450 455 460 



Glu Cys Gly Gly Asp Ser Ser 
465 470 



<210> 15 
<211> 1416 



WO 2006/047589 



PCT7US2005/038552 



-24- 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 15 

atgaaaaaca aatggtataa accgaaacgg cattgggagg agatcgagcg atggaaggac 6 0 

gttccggaag agaaatggaa cgattggctt tgacagctga cacacactgt aagaacgtta 12 0 

gatgatttaa agaaagtcat taatctgacc gaggatgaag aggaaggcgt ccgtatttct 18 0 

accaaaacga tccccttaaa tatta.cacct tactatgcfct ccttaatgga ccccgacaat 24 0 

ccgagatgcc cggtacgcat gcagfcctgtg ccgctttctg aagaaatgca caaaacaaaa 3O0 

tacgatafcgg aagacccgct tcatcjaggat gaagattcac cggtacccgg tctgacacac 36 0 

cgctatcccg accgtgtgct gtttcttgtc acgaatcaat gttccgtgta ctgccgccac 42 0 

tgcacacgcc ggcgcttttc cggacaaatc gggatgggcg tccccaaaaa acagcttgat 48 0 

gctgcaattg cttatatccg ggaaacaccc gaaatccgcg attgtttaat ttcaggcggt 540 

gatgggctgc tcatcaacga ccaastttta gaatatattt taaaagagcc gcgcagcact 6O0 

ccgcatctgg aagtcatccg catcggaaca cgtgctcccg tcgtctttcc gcagcgcatt 6S0 

accgatcatc tgtgcgagat attgaaaaaa tatcatccgg tctggctgaa cacccatttt 720 

aacacaagca tcgaaatgac agaagaatcc gttgaggcat gtgaaaagct ggtgaacgcg 780 

ggagtgccgg tcggaaatca ggctgtcgta ttagcaggta ttaatgattc ggttccaatt 84=0 

gtgaaaaagc tcatgcatga cttgcptaaaa atcagagtcc gtccttatta tatttaccaa 9O0 

tgtgatctgt cagaaggaat aaggcattcc cgtgctcctg tttccaaagg tttggagatc 9 SO 

attgaagggc tgagaggtca tacctcaggc tatgcggttc ctacctttgt cgttcacgca 1020 

ccaggcggag gaggtaaaat cgccctgcag ccgaactatg tcctgtctca aagtcctgac 10S0 

aaagtgatct taagaaattt tgaaggtgtg attacgtcat atccggaacc agagaattat 1140 

atccccaatc aggcagacgc ctattttgag tccgttttcc ctgaaaccgc tgacaaaaag 12 00 

gagccgatcg ggctgagtgc catttttgct gacaaagaag tttcgtctac acctgaaaat 12 60 

gtagacagaa tcaaacggcg tgag-gcatac atcgcaaatc cggagcatga aacattaaaa 13 20 

gatcggcgtg agaaaagagg tcag-ctcaaa gaaaagaaat ttttggcgca gcagaaaaaa 13 SO 

cagaaagaga ctgaatgcgg aggg-gattct tcataa 143.6 

<210> 16 
<211> 471 
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<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Synthetic Construct 



<400> 16 

Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Glu Glu lie Glu 



Arg Trp Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Trp Gin 



Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Val He Asn 



Leu Thr Glu Asp Glu Glu Glu Gly Val Arg He Ser Thr Lys Thr He 



Pro Leu Asn He Thr Pro Tyr Tyr Ala Ser Leu Met Asp Pro Asp Asn 



Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Glu Glu Met 



His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu His Glu Asp Glu Asp 
100 105 110 



Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro Asp Arg Val Leu Phe 
115 120 125 



Leu Val Thr Asn Gin Cys Ser Val Tyr Cys Arg His Cys Thr Arg Arg 
130 135 140 



Arg Phe Ser Gly Gin He Gly Met Gly Val Pro Lys Lys Gin Leu Asp 
145 150 155 160 



Ala Ala He Ala Tyr He Arg Glu Thr Pro Glu He Arg Asp Cys Leu 
165 170 17 5 



He Ser Gly Gly Asp Gly Leu Leu He Asn Asp Gin He Leu Glu Tyr 
180 185 190 



He Leu Lys Glu Pro Arg Ser Thr Pro His Leu Glu Val He Arg He 
195 200 205 
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Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg lie Thr Asp His Leu 
210 215 220 



Cys Glu lie Leu Lys Lys Tyr His Pro Val Trp Leu Asn Thr His Phe 
225 230 235 240 



Asn Thr Ser He Glu Met Thr Glu Glu Ser Val Glu Ala Cys Glu Lys 
245 250 255 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 265 270 



Gly He Asn Asp Ser Val Pro He Val Lys Lys Leu Met His Asp Leu 
275 280 285 



Val Lys He Arg Val Arg Pro Tyr Tyr He Tyr Gin Cys Asp Leu Ser 
290 295 300 



Glu Gly He Arg His Ser Arg Ala Pro Val Ser Lys Gly Leu Glu He 
305 310 315 320 



He Glu Gly Leu Arg Gly His Thr Ser Gly Tyr Ala Val Pro Thr Phe 
325 330 335 



Val Val His Ala Pro Gly Gly Gly Gly Lys He Ala Leu Gin Pro Asn 
340 345 350 



Tyr Val Leu Ser Gin Ser Pro Asp Lys Val He Leu Arg Asn Phe Glu 
355 360 365 



Gly Val He Thr Ser Tyr Pro Glu Pro Glu Asn Tyr He Pro Asn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Phe Pro Glu Thr Ala Asp Lys Lys 
385 390 395 400 



Glu Pro He Gly Leu Ser Ala He Phe Ala Asp Lys Glu Val Ser Ser 
405 410 415 



Thr Pro Glu Asn Val Asp Arg He Lys Arg Arg Glu Ala Tyr He Ala 
420 425 430 
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Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg Gly G3_n 
435 440 445 

Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys Lys Gin Lys Glu Thrr 
450 455 460 

Glu Cys Gly Gly Asp Ser Ser 
465 470 

<210> 17 

<211> 1416 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 



<400> 17 



atgaaaaaca 


aatggtataa 


accgaaacgg cattggaagg agatcgagtt 


atggaag-gac 


60 


gttccggaag 


agaaatggaa 


cgattggctt tgacggctga cacacactgt 


aagaaccjtta 


120 


gatgatttaa 


agaaagtcat 


taatctgacc gaggatgaag aggaaggcgt 


ccgtatfctct 


180 


accaaaacga 


tccccttaaa 


tattacacct tactatgctc ctttaatgga 


coocgacaat 


240 


ccgagatgcc 


cggtacgcat 


gcagtctgtg ccgctttccg aagaaatgca 


caaaacaaaa 


300 


tacgatatgg 


aagacccgct 


tcatgaggat gaagatacac cggtacccgg 


bccgacacac 


360 


cgctatcccg 


accgtgtgct 


gtttcttgtc acgaatcaat gctccgtgta 


ctgccgccac 


420 


tgcacacgcc 


ggcgcttttc 


cggacaaatc ggaatgggcg tccccaaaaa 


acagcttgat 


480 


gctgcaattcj 


cttatatccg 


ggaaacaccc gaaatccgcg attgtttaat 


ttcaggcggt 


540 


gatgggctgc 


tcatcaacga 


ccaaatttta gaatatattt taaaagagct 


gcgcagcatt 


600 


ccgcatctgg 


aagtcatccg 


catcggaaca cgtgctcccg tcgtctttcc 


gcagcgcatt 


660 


accgatcatc 


tgtgcgagat 


attgaaaaaa tatcatccgg tctggctgaa 


cacccafcttt 


720 


aacacaagca 


tcgaaatgac 


agaagaatcc gttgaggcat gtgaaaagct 


ggtgaacgcg 


780 


ggagtgccgg 


tcggaaatca 


ggctgtcgta ttagcaggta ttaatgattc 


ggttcca.att 


840 


atgaaaaagc 


tcatgcatga 


cttggtaaaa atcagagtcc gtccttatta 


tatttaccaa 


900 


tgtgatctgfc 


cagaaggaat 


aaggcatttc cgtgctcctg tttccaaagg 


tttggagj-atc 


960 


attgaagggc 


tgagaggtca 


tacctcaggc tatgcggttc ctacctttgt 


cgttcacgca 


1020 


ccaggcggacj 


gaggtaaaat 


cgccctgcag ccgaactatg tcctgtctca 


aagtcctgac 


1080 


aaagtgatct 


taagaaattt 


tgaaggtgtg attacgtcat atccggaacc 


agagaattat 


1140 
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atccccaatc aggcagacgc ctattttgag tccgttttcc ctgaaaccgc tgacaaaaag 1200 

gagccgatcg ggctgagtgc catttttgct gacaaagaag tttcgtctac acctgaaaat 1260 

gtagacagaa tcaaacggcg tgaggcatac afccgcaaatc cggagcatga aacattaaaa 1320 

gatcggcgtg agaaaagagg tcagctcaaa gaaaagaaat ttttggcgca gcagaaaaaa 1380 

cagaaagaga ctgaatgcgg aggggattct tcataa 141 S 

<210> 18 
<211> 471 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 18 

Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu lie Glu 
15 10 15 

Leu Trp Lys Asp Val Paro Glu Glu Lys Trp Asn Asp Trp Leu Trp Airg 
20 25 30 

Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Val lie Asn 
35 40 45 

Leu Thr Glu Asp Glu GILu Glu Gly Val Arg lie Ser Thr Lys Thr He 
50 55 60 

Pro Leu Asn lie Thr Pro Tyr Tyr Ala Pro Leu Met Asp Pro Asp Asn 
55 7 0 75 80 

Pro Arg Cys Pro Val A-ig Met Gin Ser Val Pro Leu Ser Glu Glu Met 



His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu His Glu Asp Glu Asp 
100 105 110 



Thr Pro Val Pro Gly Pro Thr His Arg Tyr Pro Asp Arg Val Leu Ptie 
115 120 125 



Leu Val Thr Asn Gin Qys Ser Val Tyr Cys Arg His Cys Thr Arg Aarg 
130 135 140 
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Arg Phe Ser Gly Gin lie Gly Met Gly Val Pro Lys Lys Gin Leu Asp 
145 150 155 160 



Ala Ala lie Ala Tyr lie Arg Glu Thr Pro Glu lie Arg Asp Cys Leu 
165 170 175 



He Ser Gly Gly Asp Gly Leu Leu He Asn Asp Gin He Leu Glu Tyr 
' 180 185 190 



He Leu Lys Glu Leu Arg Ser He Pro His Leu Glu Val He Ajrg He 
195 200 205 



Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg He Thr Asp His Leu 
210 215 220 



Cys Glu He Leu Lys Lys Tyr His Pro Val Trp Leu Asn Thr His Phe 
225 230 235 240 



Asn Thr Ser He Glu Met Thr Glu Glu Ser Val Glu Ala Cys Glu Lys 
245 250 2 55 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 265 270 



Gly He Asn Asp Ser Val Pro He Met Lys Lys Leu Met His Avsp Leu 
275 280 285 



Val Lys He Arg Val Arg Pro Tyr Tyr He Tyr Gin Cys Asp L,eu Ser 
290 295 300 



Glu Gly He Arg His Phe Arg Ala Pro Val Ser Lys Gly Leu Glu He 
305 310 315 320 



He Glu Gly Leu Arg Gly His Thr Ser Gly Tyr Ala Val Pro Thr Phe 
325 330 335 



Val Val His Ala Pro Gly Gly Gly Gly Lys He Ala Leu Gin Pro Asn 
340 345 350 



Tyr Val Leu Ser Gin Ser Pro Asp Lys Val He Leu Arg Asn Phe Glu 
355 360 365 



Gly Val He Thr Ser Tyr Pro Glu Pro Glu Asn Tyr He Pro A.sn Gin 
370 375 380 
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-30- 

Ala Asp Ala Tyr Phe Glu Ser Val Phe Pro Glu Thr Ala Asp Lys Lys 
385 390 395 4O0 

Glu Pro He Gly Leu Ser Ala He Phe Ala Asp Lys Glu Val Ser Ser 
405 410 415 

Thr Pro Glu Asn Val Asp Arg He Lys Arg Arg Glu Ala Tyr He Ala 
420 425 430 

Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg Gly Gin 
435 440 445 

Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys Lys Gin Lys Glu Thr 
450 455 460 

Glu Cys Gly Gly Asp Ser Ser 



<210> 19 
<211> 1416 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 19 

atggaaaaca aatggtataa accgaaacgg cattggaagg agatcgagtt atggaacjgac 60 
gttccggaag agaaatggaa cgattggctt tgacagctga cacacactgt aagaacgtta 12 0 
gatgatttaa agaaagtcat taatctgacc gaggatgaag aggaaggcgt ccgtatttct 180 
accaaaacga tccccttaaa tattacacct tactatgctt ctttaatgga ccccgacaat 240 
ccgagatgcc cggtacgcat gcagtctgtg ccgctttctg aagaaatgca caaaacaaaa 300 
tacgatatgg aagacccgct tcatgaggat gaagattcac cggtacccgg tctgacacac 360 
cgctatcccg accgtgtgct gtttcttgtc acgaatcaat gttccgtgta ctgccgccac 420 
tgcacacgcc ggcgcttttc cggacaaatc ggaatgggcg tccccaaaaa acagcttgat 480 
gctgcaattg cttatatccg ggaaacaccc gaaatccgcg attgtttaat ttcaggcggt 540 
gatgggctgc tcatcaacga ccaaatttta gaatatattt taaaagagct gcgcagcatt 600 
ccgcatctgg aagtcatccg catcggaaca cgtgctcccg tcgtctttcc gcagcgcatt 660 
accgatcatc tgtgcgagat attgaaaaaa tatcatccgg tctggctgaa cacccatttt 720 
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aacacaagca 


tcgaaatgac 


agaagaatcc 


gttgaggcat gtgaaaagct 


ggtgaacgcg 


780 


ggagtgccgg 


tcggaaatca 


ggctgtcgta 


ttagcaggta ttaatgattc 


ggttccaatt 


840 


atgaaaaagc 


tcatgcatga 


cttggtaaaa 


atcagagtcc gtccttatta 


tatttacczaa 


900 


tgtgatctgt 


ctgagggctt 


ggggcatttc 


cgtgctcctg tttccaaagg 


tttggagatc 


960 


attgaagggc 


tgagaggtca 


tacctcaggc 


tatgcggttc ctacctttgt 


cgttcaccjca 


1020 


ccaggcggag 


gaggtaaaat 


cgccctgcag 


ccgaactatg tcctgtcaca 


aagtcctQ-ac 


1080 


aaagtgatct 


taagaaattt 


tgaaggtgtg 


attacgtcat atccggaacc 


agagaatfcat 


1140 


atccccaatc 


aggcagacgc 


ctattttgag 


tccgttttcc ctgaaaccgc 


tgacaaaaag 


1200 


gagccgatcg 


ggctgagtgc 


catttttgct 


gacaaagaag tttcgtttac 


acctgaaaat 


1260 


gtagacagaa 


tcaaacggcg 


tgaggcatac 


atcgcaaatc cggagcatga 


aacattaaaa 


1320 


gatcggcgtg 


agaaaagaga 


tcagctcaaa 


gaaaagaaat ttttggcgca 


gcagaaaaaa 


1380 


cagaaagaga 


ctgaatgcgg 


aggggattct 


tcataa 




1416 



<210> 20 
<211> 471 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 20 

Met Glu Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu He Glu 
15 10 15 

Leu Trp Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Trp Gin 
20 25 30 

Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Val He Asn 
35 40 45 

Leu Thr Glu Asp Glu Glu Glu Gly Val Arg He Ser Thr Lys Thr He 
50 55 60 

Pro Leu Asn He Thr Pro Tyr Tyr Ala Ser Leu Met Asp Pro Asp Asn 
65 70 75 80 



Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Glu Glu Met 
85 90 95 
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His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu His Glu Asp Glu Asp 
100 105 110 



Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro Asp Arg Val Leu Phe 
115 120 125 



Leu Val Thr Asn Gin Cys Ser Val Tyr Cys Arg His Cys Thr Arg Arg 
130 135 140 



Arg Phe Ser Gly Gin lie Gly Met Gly Val Pro Lys Lys Gin Leu Asp 
145 150 155 160 



Ala Ala lie Ala Tyr lie Arg Glu Thr Pro Glu He Arg Asp Cys Leu 
165 170 175 



He Ser Gly Gly Asp Gly Leu Leu He Asn Asp Gin He Leu Glu Tyr 
180 185 190 



He Leu Lys Glu Leu Arg Ser He Pro His Leu Glu Val He Arg He 
195 200 205 



Cys Glu He Leu Lys Lys Tyr His Pro Val Trp Leu Asn Thr His Phe 
225 230 235 240 



Asn Thr Ser He Glu Met Thr Glu Glu Ser Val Glu Ala Cys Glu Lys 
245 250 255 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 265 270 



Gly He Asn Asp Ser Val Pro He Met Lys Lys Leu Met His Asp Leu 
275 280 285 



Val Lys He Arg Val Arg Pro Tyr Tyr He Tyr Gin Cys Asp Leu Ser 
290 295 300 



Glu Gly Leu Gly His Phe Arg Ala Pro Val Ser Lys Gly Leu Glu He 
305 310 315 320 
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He Glu Gly Leu Arg Gly His Thr Ser Gly Tyr Ala Val Pro Thr Phe 
325 330 335 



Val Val His Ala Pro Gly Gly Gly Gly Lys He Ala Leu Gin Pro Asn 
340 345 350 



Tyr Val Leu Ser Gin Ser Pro Asp Lys Val He Leu Arg Asn Phe Glu 
355 . 360 365 



Gly Val He Thr Ser Tyr Pro Glu Pro Glu Asn Tyr He Pro Asn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Phe Pro Glu Thr Ala Asp Lys Lys 
385 390 395 400 



Glu Pro He Gly Leu Ser Ala He Phe Ala Asp Lys Glu Val Ser Phe 
405 410 415 



Thr Pro Glu Asn Val Asp Arg He Lys Arg Arg Glu Ala Tyr He Ala 
420 425 430 



Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg Asp Gin 
435 440 445 



Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys Lys Gin Lys Glu Thr 
450 455 460 



Glu Cys Gly Gly Asp Ser Ser 
465 470 



<210> 21 
<211> 1416 ' 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 21 

atgaaaaaca aatggtataa accgaaacgg cattggaagg agatcgagtt atggaaggac 60 
gtcccggaag agaaatggaa cgattggctt tgacagctga cacacactcjt aagaacgtta 120 
gatgatttaa agaaagtcat taatctgacc gaggatgagg aggaaggcg-t ccgtatttct 180 
accaaaacga tccccttaaa tattacacct taccatgctt ctttaatgcja ccccgacaat 240 
ccgagatgcc cggtacgcat gcagtctgtg ccgctttctg aagaaatgca caaaacaaaa 300 
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tacgacatgg 


aagacccgct 


tcatgaggat 


gaagattcac cggtacccgg tccgacacac 


360 


cgctatcccg 


accgtgtgct 


gtttcttgtc 


acgaatcaat gttccgtgta ctgccgccac 


420 


tgcacacgcc 


ggctcttttc 


cggacaaatc 


ggaatgggcg tccccaaaaa acagcttgat 


480 


gctgcaattg 


cttatatccg 


ggaaacaccc 


gaaatccgcg attgtttaat ttcaggcggt 


540 


gatgggctgc 


tcatcaacga 


ccaaatttta 


gaatatattt taaaagagct gcgcagcatt 


600 


ccgcatctgg 


aagtcatccg 


catcggaaca 


cgtgctcccg tcgtctttcc gcagcgcgtt 


660 


accgatcatc 


tgtgcgagat 


attgaaaaaa 


tatcatccgg tcfcggctgaa cacccatctt 


720 


aacacaagca 


tcgaaatgac 


agaagaaccc 


gttgaggcat gtgaaaagct ggtgaacgcg 


780 


ggagtgccgg 


tcggaaatca 


ggctgtcgta 


ttagcgggta ttaatgattc ggttccaatt 


840 


atgaaaaagc 


tcatgcatga 


cttggtaaaa 


atcagagtcc gtccttatta tatttaccaa 


900 


tgtgatctgt 


cagaaggaat 


aaggcatttc 


tgtgctcctg tttccaaagg tttggagatc 


960 


attgaagggc 


tgagaggtca 


tacctcaggc 


tatgcggttc ctacctttgt cgttcacgca 


1020 


ccaggcggag 


gaggtaaaat 


cgccctgcag 


ccgaactatg tcctgtctca aagtcctgac 


1080 


aaagtgatct 


taagaaattt 


tgaaggtgtg 


attacgtcat atccggagcc agagaattat 


1140 


atccccaatc 


aggcagacgc 


ctattttgag 


tccgttttcc ctgaaaccgc tgacaaaaag 


1200 


gagccgatcg 


ggctgagtgc 


catttttgct 


gacaaagaag tttcgtctac acctgaaaat 


1260 


gtagacagaa 


tcaaacggcg 


tgaggcatac 


atcgcaaatc cggagcatga aacattaaaa 


1320 


gatcggcgtg 


agaaaagagg 


tcagctcaaa 


gaaaagaaat ttfctggcgca gcagaaaaaa 


1380 


cagaaagaga 


ctgaatgcgg 


aggggattct 


tcataa 


1416 



<210> 22 
<211> 471 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 22 

Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu lie Glu 
15 10 15 



Leu Trp Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Txp Gin 
20 25 30 
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Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Val He Asn 



Leu Thr Glu Asp Glu Glu Glu Gly Val Arg He Ser Thr Lys Thr He 



Pro Leu Asn He Thr Pro Tyr His Ala Ser Leu Met Asp Pro Asp Asn 



Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Gl-u Glu Met 



His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu His Glu Asp Glu Asp 
100 105 11 0 



Ser Pro Val Pro Gly Pro Thx His Arg Tyr Pro Asp Arg Val Leu Phe 
115 120 125 



Leu Val Thr Asn Gin Cys Ser Val Tyr Cys Arg His Cys Thx Arg Arg 
130 135 140 



Leu Phe Ser Gly Gin He Gly Met Gly Val Pro Lys Lys Gin Leu Asp 
145 150 155 160 



Ala Ala He Ala Tyr He Arg Glu Thr Pro Glu He Arg Asp Cys Leu 
165 170 175 



He Ser Gly Gly Asp Gly Leu Leu He Asn Asp Gin He Leu Glu Tyr 
180 185 19 0 



He Leu Lys Glu Leu Arg Ser He Pro His Leu Glu Val He Arg He 
195 200 205 



Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg Val Thr Asp His Leu 
210 215 220 



Cys Glu He Leu Lys Lys Tyr His Pro Val Trp Leu Asn Thur His Leu 
225 230 235 240 



Asn Thr Ser He Glu Met Thr Glu Glu Pro Val Glu Ala Cys Glu Lys 
245 250 255 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 265 27 0 
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Gly lie Asn Asp Ser Val Pro lie Met Lys Lys Leu Met His Asp Leu 
275 280 285 



Val Lys lie Arg Val Arg Pro Tyr Tyr He Tyr Gin Cys Asp Leu Ser 
290 295 300 



Glu Gly He Arg His Phe Cys Ala Pro Val Ser Lys Gly Leu Glu He 
305 310 315 320 



He Glu Gly Leu Arg Gly His Thr Ser Gly Tyr Ala Val Pro Thr Phe 
325 330 335 



Val Val His Ala Pro Gly Gly Gly Gly Lys He Ala Leu Gin Pro Asn 
340 345 350 



Tyr Val Leu Ser Gin Ser Pro Asp Lys Val He Leu Arg Asn Phe Glu 
355 360 365 



Gly Val He Thr Ser Tyr Pro Glu Pro Glu Asn Tyr He Pro Asn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Phe Pro Glu Thr Ala Asp Lys Lys 
385 390 395 400 



Glu Pro He Gly Leu Ser Ala He Phe Ala Asp Lys Glu Val Ser Ser 
405 410 415 



Thr Pro Glu Asn Val Asp Arg He Lys Arg Arg Glu Ala Tyr He Ala 
420 425 430 



Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg Gly Gin 
435 440 445 



Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys Lys Gin Lys Glu Thr 
450 455 460 



Glu Cys Gly Gly Asp Ser £ 
465 470 



<210> 23 

<211> 1416 

<212> DNA 

<213> Artificial 
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<220> 

<223> Synthetic Construct 
<400> 23 

atgaaaaaca aatggtataa accgaaacgg cattggaagg agatcgagtt atggaaagac 60 

gttccggacg aaaagtggaa cgattggctt tgacagctga cacacactgt aagaacgtta 120 

gatgattcaa agaaagtcat taatctgacc gaggatgaag aggaaggcg-t ccgtatttct 180 

accaaaacga tccccttaaa tattacacct tactatgctt ctttaatgga ccccgacaat 240 

ccgagatgcc cggtacgcat gcagtctgtg ccactttctg aagaaatgca caaaacaaaa 300 

tacgatatgg aagacccgct tcatgaggat gaagattcac cggtacccgg tctgacacac 360 

cgctatcccg gccgtgtgct gtttcttgtc acgaatcaat gttccgtgca ctgccgccac 420 

tgcacacgcc ggcgcttttc cggacaaatc ggaatgggcg tccccgaaaa acagcttgat 480 

gctgcaattg cttatatccg ggaaacaccc gaaatccgcg attgtttaat ttcaggcggt 540 

gatgggctgc tcatcaacga ccaaatttta gaatatattt taaaagagct gcgcagcatt 600 

ccgcatctgg aagtcatccg catcggaaca cgtgctcccg tcgtctttcc gcagcgcatt 660 

accgatcatc tgtgcgagat attgaaaaaa tatcatccgg tctggctgaa cacccatttt 720 

aacacaagca tcgaaatgac agaagaatcc gttgaggcat gtgaaaagct ggtgaacgcg 780 

ggagtgccgg tcggaaatca ggctgtcgta ttagcaggta ttaatgattc ggttccaatt 840 

atgaaaaagc tcatgcatga cttggtaaaa atcagagtcc gtccttatta tatttaccaa 900 

tgtgatctgt cagaaggaat aaggcatttc cgtgctcctg tttccaaagg tttggagatc 960 

attgaagggc tgagaggtca tacctcaggc tatgcggttc ctacctttgt cgttcacgca 1020 

ccaggcggag gaggtaaaat cgccctgcag ccgaactatg tcctgtctca aagtcctgac 1080 

aaagtgatct taagaaattt tgaaggtgtg attacgtcat atccggaacc agagaattat 1140 

atccccaatc aggcagacgc ctattttgag tccgttttcc ctgaaacccjc tgacaaaaag 1200 

gagccgatcg ggctgagtgc catttttgct ggcaaagaag tttcgtctac acctgaaaat 1260 

gtagacagaa tcaaacggcg tgaggcatac atcgcaaatc cggagcatgja aacattaaaa 1320 

gatcggcgtg agaaaagagg tcagctcaaa gaaaagaaat ttttggcgca gcagaaaaaa 1380 

cagaaagaga ctgaatgcgg aggggattct tcataa 1416 

<210> 24 
<211> 471 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Synthetic Construct 



<400> 24 



Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu lie Glu 
15 10 15 



Leu Trp Lys Asp Val Pro Asp Glu Lys Trp Asn Asp Trp Leu Trp Gin 
20 25 30 

Leu Thr His Thr Val Arg Thr Leu Asp Asp Ser Lys Lys Val lie Asn 



Leu Thr Glu Asp Glu Glu Glu Gly Val Arg He Ser Thr Lys Thr He 
50 55 60 



Pro Leu Asn He Thr Pro Tyr Tyr Ala Ser Leu Met Asp Pro Asp Asn 
65 70 75 80 



Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Glu Glu Met 



His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu H±s Glu Asp Glu Asp 
100 105 110 



Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro Gly Arg Val Leu Phe 
115 120 125 



Leu Val Thr Asn Gin Cys Ser Val His Cys Arg His Cys Thr Arg Arg 
130 135 140 



Arg Phe Ser Gly Gin He Gly Met Gly Val Pro Glu Lys Gin Leu Asp 
145 150 155 160 



Ala Ala He Ala Tyr He Arg Glu Thr Pro Glu He Arg Asp Cys Leu 
165 170 175 



He Ser Gly Gly Asp Gly Leu Leu He Asn Asp Gin He Leu Glu Tyr 
180 185 190 



He Leu Lys Glu Leu Arg Ser He Pro His Leu Glu Val He Arg He 
195 200 205 
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Gly Thr Arg- Ala Pro Val Val Phe Pro Gin Arg lie Thr A.sp His Leu 
210 215 220 



Cys Glu lie Leu Lys Lys Tyr His Pro Val Trp Leu Asn Thr His Phe 
225 230 235 240 



Asn Thr Ser lie Glu Met Thr Glu Glu Ser Val Glu Ala Cys Glu Lys 

245 250 255 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 265 270 



Gly lie Asn Asp Ser Val Pro lie Met Lys Lys Leu Met His Asp Leu 
275 280 285 



Val Lys He Arg Val Arg Pro Tyr Tyr He Tyr Gin Cys A.sp Leu Ser 
290 295 300 



Glu Gly He Arg His Phe Arg Ala Pro Val Ser Lys Gly L.eu Glu He 
305 310 315 320 



He Glu Gly Leu Arg Gly His Thr Ser Gly Tyr Ala Val Pro Thr Phe 
325 330 335 



Val Val His Ala Pro Gly Gly Gly Gly Lys He Ala Leu G-ln Pro Asn 
340 345 3 50 



Tyr Val Leu Ser Gin Ser Pro Asp Lys Val He Leu Arg Asn Phe Glu 
355 360 365 



Gly Val He Thr Ser Tyr Pro Glu Pro Glu Asn Tyr He Pro Asn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Phe Pro Glu Thr Ala A_sp Lys Lys 
385 390 395 400 



Glu Pro He Gly Leu Ser Ala He Phe Ala Gly Lys Glu Val Ser Ser 
405 410 " 415 



Thr Pro Glu Asn Val Asp Arg He Lys Arg Arg Glu Ala T^r He Ala 
420 425 4 30 



Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys hjcg Gly Gin 
435 440 445 
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-40- 



Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys Lys Gin Lys Gl_u Thr 
450 455 460 

Glu Cys Gly Gly Asp Ser Ser 
465 470 



<210> 25 
<211> 1416 
<212> DNA 

<213> Artificial Sequence 








<220> 

<223> Synthetic Construct 








<400> 25 
atgaaaaaca 


aatggtataa 




cattggaagg agatcgagtt 


atgjgaaggac 


60 








tgacagctga cacacactgt 


aag/aacgttg 


120 








gaggatgaag aggaaggcgt 


ccg-tatttct 


180 








tactatgctt ctttaatgga 


ccccgacaaa 


240 


ccgagatgcc 






ccgctttctg aagaaatgca 


caaaacaaaa 


300 


tacgatatgg 


aagacccgct 


tcatgaggat 


gaagattcac cggtacccgg 


tctgacacac 


360 


cgctatcccg 


accgtgtgct 


gtttcttgtc 


acgaatcaat gttccgtgta 


ctg-ccgccac 


420 


tgcacacgcc 


ggcgcttttc 


cggacaaatc 


ggaatgggcg tccccaaaaa 


acetgcttgat 


480 


gctgcaattg 


cttatatccg 


ggaaacaccc 


gaaatccgcg attgtttaat 


ttc aggcggt 


540 


gatgggctgc 


tcatcaacga 


ccaaatttta 


gaatatattt taaaagagct 


gcg-cagcatt 


600 


ccgcatctgg 


aagtcatccg 


catcggaaca 


cgtgctcccg tcgtctttcc 


gca.gcgcatt 


660 


accgatcatc 


tgtgcgagat 


attgaaaaaa 


tatcatccgg tctggctgaa 


cacccatttt 


720 


aacacaagca 


tcgaaatgac 


agaagaatcc 


gttgaggcat gtgaaaagct 


ggfcgaacgcg 


780 


ggagtgccgg 


tcggaaatca 


ggctgtcgta 


ttagcaggta ttaatgattc 


ggfctccaatt 


840 


atgaaaaagc 


tcatgcatga 


cttggtaaaa 


atcagagtcc gtccttatta 


tatttaccaa 


900 


tgtgacctgt 


cagaaggaat 


aaggcatttc 


cgtgctcctg tttccaaagg 


ttfc ggggatc 


960 


attgaagggc 


tgggaggtca 


tacctcaggc 


tatgcggttc ctacctttgt 


cgt tcacgca 


1020 


ccaggcggag 


gaggtaaaat 


cgccctgcgg 


ccgaactatg tcctgtctca 


aag-tcctgac 


1080 


aaagtgatct 


taagaaattt 


tgaaggtgtg 


attacgtcat atccggaacc 


agagaattat 


1140 


atccccaatc 


aggcagacgc 


ctattttgag 


tccgttttcc ctgaaaccgc 


tg&caagaag. 


1200 
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gagccgatcg ggctgagtgc catttttgct gacaaagaag tttcgtctac acctgaaaat 12 60 

gtagacagaa tcaaacggcg tgaggcatac atcgcaaatc cggagcatga aacattaaaa 1320 

gatcggcgtg agaaaagagg tcagctcaaa gaaaagaaat ttttggcgca gcagaaaaaa 1380 

cagaaagaga ctgaatgcgg aggggattct tcataa 1416 

<210> 26 
<211> 471 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 26 

Met Lys Asn Lya Trp Tyr Lys Pro Lys Arg His Trp Lys Glu He Glu 
15 10 15 

Leu Trp Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Trp Gin 
20 25 30 

Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Val He Asn 
35 40 45 

Leu Thr Glu Asp Glu Glu Glu Gly Val Arg He Ser Thr Lys Thr He 
50 55 60 

Pro Leu Asn He Thr Pro Tyr Tyr Ala Ser Leu Met Asp Pro Asp Lys 
65 70 75 80 

Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Glu Glu Met 



His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu His Glu Asp Glu Asp 
100 105 110 



Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro Asp Arg Val Leu Phe 
115 120 125 



Leu Val Thr Asn Gin Cys Ser Val Tyr Cys Arg His Cys Thr Arg Arg 
130 135 140 



Arg Phe Ser Gly Gin He Gly Met Gly Val Pro Lys Lys Gin Leu Asp 
145 150 155 160 
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Ala Ala lie Ala Tyr lie Arg Glu Thr Pro Glu lie Arg Asp Cys Leu 
165 170 175 



lie Ser Gly Gly Asp Gly Leu Leu lie Asn Asp Gin lie Leu Glu Tyr 
180 185 190 



lie Leu Lys Glu Leu Arg Ser lie Pro His Leu Glu Val lie Arg lie 
195 200 205 



Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg lie Thr Asp His Leu 
210 215 220 



Cys Glu lie Leu Lys Lys Tyr His Pro Val Trp Leu Asn Thr His Phe 
225 230 235 240 



Asn Thr Ser lie Glu Met Thr Glu Glu Ser Val Glu Ala Cys Glu Lys 
245 250 255 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 265 270 



Gly lie Asn Asp Ser Val Pro lie Met Lys Lys Leu Met His Asp Leu 
275 280 285 



Val Lys lie Arg Val Arg Pro Tyr Tyr lie Tyr Gin Cys Asp Leu Ser 
290 295 300 



Glu Gly He Arg His Phe Arg Ala Pro Val Ser Lys Gly Leu Gly He 
305 310 315 320 



He Glu Gly Leu Gly Gly His Thr Ser Gly Tyr Ala Val Pro Thr Phe 
325 330 335 



Val Val His Ala Pro Gly Gly Gly Gly Lys He Ala Leu Arg Pro Asn 
340 345 350 



Tyr Val Leu Ser Gin Ser Pro Asp Lys Val He Leu Arg Asn Phe Glu 
355 360 365 



Gly Val He Thr Ser Tyr Pro Glu Pro Glu Asn Tyr He Pro Asn Gin 
370 375 380 
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Ala Asp Ala Tyr Phe Glu Ser Val Phe Pro Glu Thr Ala Asp Lys Lys 
385 390 395 400 

Glu Pro lie Gly Leu Ser Ala lie Phe Ala Asp Lys Glu Val Ser Ser 
405 410 415 

Thr Pro Glu Asn Val Asp Arg lie L,ys Arg Arg Glu Ala Tyr lie Ala 
420 425 430 

Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg Gly Gin 
435 440 445 

Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys Lys Gin Lys Glu Thr 
450 455 460 

Glu Cys Gly Gly Asp Ser Ser 



<210> 27 
<211> 1416 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 27 

atgaaaaaca aatggtataa accgaaacgg cattggaagg agatcgagtt atggaaggac 60 
gttccgggag agaaatggaa cgattggctt tgacagctga cacacactgt aagaacgtta 120 
gatgatttaa agaaagtcat taatctgacc gaggatgaag aggaaggcgt ccgtatttct 180 
accaaaacga tccccttaaa tattacacct tgctatgctc ctttaatgga ccccgacaac 240 
ccgagatgcc cggtacgcat gcagtctgtff ccgctttctg aagaaatgca caaaacaaaa 300 
tacgatatgg aagacccgct tcgtgaggat gaagattcac cggtacccgg tctgacacac 360 
cgctatcccg accgtgtgct gtttcttgtc acgaatcaat gttccgtgta ctgccgccac 420 
tgcacacgcc ggcgcttttc cggacaaatc ggaatgggcg tccccaaaaa acagcttgat 480 
gctgcaattg cttatatccg ggaaacaccc - gaaatccgcg attgtttaat ttcaggcggt 540 
gatgggctgc tcatcaacgg ccaaatttta gaatatattt taaaagagct gcgcagcatt 600 
ccgcatctgg aagtcatccg catcggaaca cgtgctcccg tcgtctttcc gcagcgcatt 660 
accgatcatc tgtgcgagat attgaaaaaa tatcatccgg tctggctgaa cacccatttt 720 
aacacaagcg tcgaaatgac agaagaatcc gttgaggcat gtgaaaagct ggtgaacgcg 780 
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ggagtgccgg 


tcggaaatca ggctgtcgta 


ttagcaggta 


ttaatgattc 


ggttccaatt 


840 


atgaaaaagc 


tcatgcatga cttggtaaaa 


atcagagtcc gtccttatta 


tatttaccaa 


900 


tgtgatctgt 


cagaaggaat aaggcatttc 


cgtgctcctg 


tttccaaagg 


tttggagatc 


960 


attgaagggc 


tgagaggtca tacctcaggc 


tatgcggttc 


ctacctttgt 


cgttcacgca 


1020 


ccaggcggag 


ggggtaaaat cgccctgcag 


ccgaactatg 


tcctgtctca 


aagtcctgac 


1080 


aaagtaatct 


taagaaattt tgaaggtgtg 


attacgtcat atccggaacc 


agagaattat 


1140 


atccccaatc 


aggcagacgc ctattttgag 


tccgttttcc 


ctggaaccgc 


tgacaaaaag 


1200 


gagccgatcg 


ggctgagtgc catttttgct 


gacaaagaag 


tttcgtctac 


acctgaaaat 


1260 


gtagacagaa 


tcaaacggcg tgaggcatac 


atcgcaaatc 


cggagcatga 


aacattaaaa 


1320 


gatcggcgtg 


agaaaagagg tcagctcaaa 


gaaaagaaat 


ctttggcgca 


gcagaaaaaa 


1380 


cagaaagaga 


ctgaatgcgg aggggattct 


tcataa 






1416 



<210> 28 
<211> 471 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 28 

Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu He Glu 
15 10 15 

Leu Trp Lys Asp Val Pro Gly Glu Lys Trp Asn Asp Trp Leu Trp Gin 
20 25 30 

Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Val He Asn 
35 40 45 

Leu Thr Glu Asp Glu Glu Glu Gly Val Arg He Ser Thr Lys Thr He 
50 55 60 

Pro Leu Asn He Thr Pro Cys Tyr Ala Pro Leu Met Asp Pro Asp Asn 
65 70 75 80 



Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Glu Glu Met 
85 90 95 
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His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu Arg Glu Asp Glu Asp 
100 105 110 



Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro Asp Arg Val Leu Phe 
115 120 125 



Leu Val Thr Asn Gin Cys Ser Val Tyr Cys Arg His Cys Thr Arg Arg 
130 135 140 



Arg Phe Ser Gly Gin He Gly Met Gly Val Pro Lys Lys Gin Leu Asp 
145 150 155 160 



Ala Ala He Ala Tyr He Arg Glu Thr Pro Glu He Arg Asp Cys Leu 
165 170 175 



He Ser Gly Gly Asp Gly Leu Leu He Asn Gly Gin He Leu Glu Tyr 
180 185 190 



He Leu Lys Glu Leu Arg Ser He Pro His Leu Glu Val He Arg He 
195 200 205 



Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg He Thr Asp His Leu 
210 215 220 



Cys Glu He Leu Lys Lys Tyr His Pro Val Trp Leu Asn Thr His Phe 
225 230 235 240 



Asn Thr Ser Val Glu Met Thr Glu Glu Ser Val Glu Ala Cys Glu Lys 
245 250 255 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 265 270 



Gly He Asn Asp Ser Val Pro He Met Lys Lys Leu Met His Asp Leu 
275 280 285 



Val Lys He Arg Val Arg Pro Tyr Tyr He Tyr Gin Cys Asp Leu Ser 
290 295 300 



Glu Gly He Arg His Phe Arg Ala Pro Val Ser Lys Gly Leu Glu He 
305 310 315 * 320 



He Glu Gly Leu Arg Gly His Thr Ser Gly Tyr Ala Val Pro Thr Phe 
325 330 335 
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Val Val His Ala Pro Gly Gly Gly Gly Lys He Ala Leu Gin Pro Asn 
340 345 350 



Tyr Val Leu Ser Gin Ser Pro Asp Lys Val He Leu Arg Asn Phe Glu 
355 360 365 



Gly Val He Thr Ser Tyr Pro Glu Pro Glu Asn Tyr He Pro Asn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Phe Pro Gly Thr Ala Asp Lys Lys 
385 ' 390 395 400 



Glu Pro He Gly Leu Ser Ala He Phe Ala Asp Lys Glu Val Ser Ser 
405 410 415 



Thr Pro Glu Asn Val Asp Arg He Lys Arg Arg Glu Ala Tyr He Ala 
420 425 430 



Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg Gly Gin 
435 440 445 



Leu Lys Glu Lys Lys Ser Leu Ala Gin Gin Lys Lys Gin Lys Glu Thr 
450 455 460 



Glu Cys Gly Gly Asp Ser Ser 



<210> 29 

<211> 1416 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 

<400> 29 

atgaaaaaca aatggtataa accgaaacgg cattggaagg agatcgagtt atggaaggac 



120 
180 



gttccggaag agaaatggaa cgattggctt tgacggctga cacacactgt aagaacgtta 

gatgatttaa agaaagtcat taatctgacc gaggatgaag aggaaggcgt ccgtatttct 

accaaaacga tccccttaag tattacacct tactatgctt ctttaatgga ccccgacaat 240 

ccgagatgcc cggtacgcat gcagtctgtg ccgctttctg aggaaatgca caaaacaaaa 300 

tacgatatgg aagacccgct tcatgaggat gaagattcac cggtacccgg tctgacacac 3 60 
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cgctatcccg accgtgtgct 


gtttcttgtc 


acgaatcaat gttccgtgta crtgccgccgc 


420 


tgcacacgcc ggcgcttttc 


cggacagatc 


ggaatgggcg tccccaaaaa eicagcttgat 


480 


gctgcaattg cttatatccg 


ggaaacaccc 


gaaatccgcg attgtttaat fctcaggcggt 


540 


gatgggctgc tcatcaacga 


ccaaatttta 


gaatatattt taaaagagct ^cgcagcatt 


600 


ccgcatctgg aagtcatccg 


catcggaaca 


cgtgctcccg tcgtctttcc cgcagcgcatt 


560 


accgatcatc tgtgcgagat 


attgaaaaaa 


tatcatccgg tctggctgaa eracccatttt 


720 


aacacaagca tcgaaatgac 


agaagaatcc 


gttgaggcat gtgaaaagct sgtgaacgcg 


780 


ggagtgccgg tcggaaatca 


ggctgtcgta 


ttagcaggta ttaatgattc g-gttccaatt 


840 


atgaaaaagc tcatgcatga 


cctggtaaaa 


atcagagtcc gtccttatta fcatttaccaa 


900 


tgtgatctgt cagaaggaat 


acggcatttc 


cgtgctcctg tttccaaagg fcttggagatc 


960 


attgaagggc tgagaggtca 


tacctcaggc 


tatgcggttc ctacctttgt cgttcacgca 


1020 


ccaggcggag gaggtaaaat 


cgccctgcag 


ccgaactatg tcctgtctca eaagtcctgac 


1080 


aaagtgatct taagaaattt 


tgaaggtgtg 


attacgtcat atccggaacc agagaattat 


1140 


atccccaatc aggcagacgc 


ctattttgag 


tccgttttcc ctgaaaccgc tgacaaaaag 


1200 


gagccgatcg ggctgagtgc 


catttttgct 


gacaaagaag tttcgtctac acctgaaaat 


1260 


gtagacagaa tcaaacggcg 


tgaggcatac 


atcgcaaatc cggagcatga aacattaaaa 


1320 


gatcggcgtg agaaaagagg 


tcagctcaaa 


gaaaagaaat ttttggcgca g-cagaaaaaa 


1380 


cagaaagaga ctgaatgcgg 


aggggattct 


tcataa 


1416 



<210> 30 
<211> 471 
<212> PRT 

<213> Artificial Sequence 
<2'20> 

<223> Synthetic Construct 
<400> 30 

Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu lie Glu 
15 10 15 

Leu Trp Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Trp Arg 
20 25 30 

Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Val lie Asn 
35 40 45 
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Leu Thr Glu Asp Glu Glu Glu Gly Val Arg lie Ser Thr Lys Thr lie 



Pro Leu Ser He Ttir Pro Tyr Tyr Ala Ser Leu Met Asp Pro Asp Asn 



Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Glu Glu Met 



His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu His Glu Asp Glu Asp 
100 105 110 



Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro Asp Arg Val Leu Phe 
115 120 125 



Leu Val Thr Asn Gin Cys Ser Val Tyr Cys Arg Arg Cys. Thr Arg Arg 
130 135 140 



Arg Phe Ser Gly Gin lie Gly Met Gly Val Pro Lys Lys Gin Leu Asp 
145 150 155 160 



Ala Ala lie Ala Tyr lie Arg Glu Thr Pro Glu He Arg Asp Cys Leu 
165 170 175 



He Ser Gly Gly Asp Gly Leu. Leu He Asn Asp Gin He Leu Glu Tyr 
180 185 190 



He Leu Lys Glu Leu Arg Ser He Pro His Leu Glu Val He Arg He 
195 200 205 



Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg He Thr Asp His Leu 
210 215 220 



Cys Glu He Leu Lys Lys Tyr His Pro Val Trp Leu Asn Thr His Phe 
225 230 235 240 



Asn Thr Ser He Glu Met Thr Glu Glu Ser Val Glu Ala Cys Glu Lys 
24=5 250 255 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 265 270 
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Gly lie Asn Asp Ser Val Pro lie Met Lys Lys Leu Met His Asp Leu 
275 280 285 



Val Lys He Arg Val Arg Pro Tyr Tyr He Tyr Gin Cys Asp Leu Ser 
290 295 300 



Glu Gly He Arg His Phe Arg Ala Pro Val Ser Lys Gly Xeu Glu He 
305 310 315 320 



He Glu Gly Leu Arg Gly His Thr Ser Gly Tyr Ala Val Pro Thr Phe 
325 330 335 



Val Val His Ala Pro Gly Gly Gly Gly Lys He Ala Leu Gin Pro Asn 
340 345 350 



Tyr Val Leu Ser Gin Ser Pro Asp Lys Val He Leu Arg Asn Phe Glu 
355 360 365 



Gly Val He Thr Ser Tyr Pro Glu Pro Glu Asn Tyr He Pro Asn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Phe Pro Glu Thr Ala Asp Lys Lys 
385 390 395 400 



Glu Pro He Gly Leu Ser Ala He Phe Ala Asp Lys Glu Val Ser Ser 
405 410 415 



Thr Pro Glu Asn Val Asp Arg He Lys Arg Arg Glu Ala Tyr He Ala 
420 425 430 



Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg Gly Gin 
435 440 445 



Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys Lys Gin Lys Glu Thr 
450 455 460 



Glu Cys Gly Gly Asp Ser Ser 
465 470 



<210> 31 

<211> 1416 

<212> DNA 

<213> Artificial Sequence 



<220> 
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<223> Synthetic Construct 
<400> 31 



atgaaaaaca 


aatggtataa 


accgaaacgg 


cattggaagg agatcgagtt 


atggaaggac 


60 


gttccggaag 


agaaatggaa 


cgattggctt 


tgacagctga cacgcactgt aagaacgtta 


120 


gatgatttaa 


agaaagtcat 


taatctgacc 


gaggatgaag aggaaggcgt 


ccgtattfcct 


180 


accaaaacga 


tccccttaaa 


tattacacct 


tactatgcga gcttaatgga 


tccagaaaac 


240 


ccacgttgtc 


cggtacgcat 


gcagtctgtg 


ccgctttctg aagaaatgca cacaagcaaa 


300 


tatgacatgg 


aagatccgct 


tcatgaggat 


gaagattcac cggtacccgg 


tctgacacac 


360 


cgctatcccg 


accgtgtgct 


gtttcttgtc 


acgagtcaat gtcccgtgta 


ctgccgccac 


420 


tgcacacgcc 


ggcgcttttc 


cggacaaatc 


ggaatgggcg tccccaaaaa 


acagcttgjat 


480 


gctgcaattg 


cttatatccg 


ggaaacaccc 


gaaatccgcg attgtttaat 


ttcaggcg-gt 


540 


gatgggctgc 


tcatcaacga 


ccaaatttta 


gaatatattt taaaagagct 


gcgcagcatt 


600 


ccgcatctgg 


gagtcatccg 


catcggaaca 


cgtgctcccg tcgtctttcc 


gcagcgcatt 


660 


accgatcatc 


tgtgcgagat 


attgaaaaga 


tatcatccgg tctggctgaa 


cacccatttt 


720 


aacacaagca 


tcgaaatgac 


agaagaatcc 


gttgaggcat gtgaaaagct 


ggtgaaccjcg 


780 


ggagtgccgg 


tcggaaatca 


ggctgtcgta 


ttagcaggta ttaatgattc 


ggttccaatt 


840 


atgaaaaagc 


tcatgcatga 


cttggtaaaa 


atcagagtcc gtccttatta 


tatttaccaa 


900 


tgtgatctgt 


cagaaggaat 


aaggcatttc 


cgtgctcctg tttccaaagg 


tttggagatc 


960 


attgaagggc 


tgagaggtca 


tacctcaggc 


tatgcggttc ctacctttgt 


cgttcaccjca 


1020 


ccaggcggag 


gaggtaaaat 


cgccctgcag 


ccgaactatg tcctgtctca 


aagtcctcjac 


1080 


aaagtgatct 


taagaaattt 


tgaaggtgtg 


attacgtcat atccggaacc 


agagaatfcat 


1140 


atccccaatc 


aggcagacgc 


ctattttgag 


tccgttttcc ctgaaaccgc 


tgacaaaaag 


1200 


gagccgatcg 


ggctgagtgc 


catttttgct 


gacaaagaag tttcgtctac 


acctgaaaat 


1260 


gtagacagaa 


tcaaacggcg 


tgaggcatac 


atcgcaaatc cggagcatga 


aacattaaaa 


1320 


gatcggcgtg 


agaaaagagg 


tcagctcaaa 


gaaaagaaat ttttggcgca 


gcagaaaaaa 


1380 


cagaaagaga 


ctgaatgcgg 


aggggattct 


tcataa 




1416 



<210> 32 

<211> 471 

<212> PRT 

<213> Artificial Sequence 



<220: 



WO 2006/047589 



PCT/US2005/038552 



-51- 

<223> Synthetic Construct 
<400> 32 

Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu lie Glu. 
15 10 15 

Leu Trp Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Trp Gin 
20 25 30 

Leu Thr Arg Thx Val Arg Thr Leu Asp Asp Leu Lys Lys Val lie Asn 
35 40 45 

Leu Thr Glu Asp Glu Glu Glu Gly Val Arg lie Ser Thr Lys Thr lie 
50 55 60 

Pro Leu Asn He Thr Pro Tyr Tyr Ala Ser Leu Met Asp Pro Glu Asn 
65 70 75 80 

Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Glu Glu Met 



His Thr Ser Lys Tyr Asp Met Glu Asp Pro Leu His Glu Asp Glu Ased 
100 105 110 



Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro Asp Arg Val Leu Phe 
115 120 125 



Leu Val Thr Ser Gin Cys Pro Val Tyr Cys Arg His Cys Thr Arg Arg- 
130 135 140 



Arg Phe Ser Gly Gin He Gly Met Gly Val Pro Lys Lys Gin Leu Asp> 
145 150 155 160 



Ala Ala He Ala Tyr He Arg Glu Thr Pro Glu He Arg Asp Cys Leu 
165 170 175 



He Ser Gly Gly Asp Gly Leu Leu He Asn Asp Gin He Leu Glu Tyr 
180 185 190 



He Leu Lys Glu Leu Arg Ser He Pro His Leu Gly Val He Arg He 
195 200 205 



Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg He Thr Asp His Leu. 
210 215 220 
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Cys Glu He Leu Lys Arg Tyr His Pro Val Trp Leu Asn Thr His Phe 
225 230 235 240 



Asn Thr Ser lie Glu Met Thr Glu Glu Ser Val Glu Ala Cys Glu Lys 
245 250 255 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 265 270 



Gly He Asn Asp Ser Val Pro lie Met Lys Lys Leu Met His Asp Leu 
2-75 280 285 



Val Lys He Arg Val Arg Pro Tyr Tyr lie Tyr Gin Cys Asp Leu Ser 
290 295 300 



Glu Gly lie Arg His Phe Arg Ala Pro Val Ser Lys Gly Leu Glu He 
305 310 315 320 



He Glu Gly Leu Arg Gly His Thr Ser Gly Tyr Ala Val Pro Thr Phe 
325 330 335 



Val Val His Ala Pro Gly Gly Gly Gly Lys He Ala Leu Gin Pro Asn 
340 345 350 



Tyr Val Leu Ser Gin Ser Pro Asp Lys Val He Leu Arg Asn Phe Glu 
3 55 360 365 



Gly Val I le Thr Ser Tyr Pro Glu Pro Glu Asn Tyr He Pro Asn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Phe Pro Glu Thr Ala Asp Lys Lys 
385 390 395 400 



Glu Pro He Gly Leu Ser Ala He Phe Ala Asp Lys Glu Val Ser Ser 
405 410 415 



Thr Pro Glu Asn Val Asp Arg He Lys Arg Arg Glu Ala Tyr He Ala 
420 425 430 



Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg Gly Gin 
435 440 445 
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Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys Lys Gin Lys Glu Thr 
450 455 460 

Glu Cys Gly Gly Asp Ser Ser 
465 470 

<210> 33 
<211> 1416 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 33 

atgaaaaaca aatggtataa accgaaacgg cattggaagg agatcgagtt atggaaggac 60 

gttccggaag agaaatggaa cgattggctt tgacagctga cacacactgt aagaacgtta 120 

gatgatttaa agaaagtcat taatctgacc gaggatgaag aggaaggcgt ccgtatttct 180 

accaaaacga tccccttaaa tattacacct tactatgctt ctttaatgga ccccgacaat 240 

ccgagatgcc cggtacgcat gcagtctgtg ccgctttctg aagaaatgca caaaacaaaa 3 00 

tacgatatgg aagacccgct tcatgaggat gaagattcac cggtacccgg tctgacacac 3 60 

cgctatcccg accgtgtgct gtttcttgtc acgaatcaat gttccgtgta ctgccgccac 420 

tgcacacgcc ggcgcttttc cggacaaatc ggaatgggcg tccccaaaaa acagcttgat 480 

gctgcaattg cttatatccg ggaaacaccc gaaatccgcg actgtctgtt gtctggcggt 540 

gatgggctgc tcatcaacga ccaaatttta gaatatattt taaaagagct gcgcagcatt 600 

ccgcatctgg aagtcatccg catcggaaca cgtgctcccg tcgtctttcc gcagcgcatt 660 

accgatcacc tgtgcgagat gttaaaaaaa tatcatccgg tctggctgaa cacccatttt 720 

aacacaagca tcgaaatgac agaagaatcc gttgaggcat gtgaaaagct ggtgaacgcg 780 

ggagtgccgg tcggaaatca ggctgtcgta ttagcaggta ttaatgattc ggttccaatt 840 

atgaaaaagc tcatgcatga cttggtaaaa atcagagtcc gtccttatta tatttaccaa 900 

tgtgatctgt cagaaggaat aaggcatttc cgtgctcctg tttccaaagg tttggagatc 960 

attgaagggc tgagaggtca tacctcaggc tatgcggttc ctacctttgt cgttcacgca 1020 

ccaggcggag gaggtaaaat cgccctgcag ccgaactatg tcctgtctca aagtcctgac 1080 

aaagtgatct taagaaattt tgaaggtgtg attacgtcat atccggaacc agagaattat 1140 

atccccaatc aggcagacgc ctattttgag tccgttttcc ctgaasccgc tgacaaaaag 1200 

gagccgatcg ggctgagtgc gctgtttgct gacaaagaag tttcgtctac acctgaaaat 1260 
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gtagacagaa tcaaacggcg tgaggcatac atcgcaaatc cggagcatg-a aacattaaaa 1320 
gatcggcgtg agaaaagagg tcagctcaaa gaaaagaaat ttttggcgca gcagaaaaaa 1380 
cagaaagaga ctgaatgcgg aggggattct tcataa 1416 

<210> 34 
<211> 471 
<212> PRT 

<213> Artificial Sequence 
<22 0> 

<223> Synthetic Construct 
<400> 34 

Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu He Glu 
15 10 15 

Leu Trp Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Trp Gin 



Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Val He Asn 
35 40 45 



Leu Thr Glu Asp Glu Glu Glu Gly Val Arg He Ser Thr Lys Thr He 
50 55 60 



Pro Leu Asn He Thr Pro Tyr Tyr Ala Ser Leu Met Asp Pro Asp Asn 



Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Glu Glu Met 



His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu His Glu Asp Glu Asp 
100 105 3-10 



Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro Asp Arg Val Leu Phe 
115 120 125 



Leu Val Thr Asn Gin Cys Ser Val Tyr Cys Arg His Cys Thr Arg Arg 
130 135 140 



Arg Phe Ser Gly Gin He Gly Met Gly Val Pro Lys Lys Gin Leu Asp 
145 150 155 160 
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Ala Ala He Ala Tyr He Arg Glu Thr Pro Glu He Arg Asp Cys Leu 
165 170 175 



Leu Ser Gly Gly Asp Gly Leu Leu He Asn Asp Gin He Leu Glu Tyr 
180 185 190 



He Leu Lys Glu Leu Arg Ser He Pro His Leu Glu Val He Arg He 
195 200 205 



Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg He Th.r Asp His Leu 
210 215 220 



Cys Glu Met Leu Lys Lys Tyr His Pro Val Trp Leu Asn Thr His Phe 
225 230 235 240 



Asn Thr Ser He Glu Met Thr Glu Glu Ser Val Glu Ala Cys Glu Lys 
245 250 255 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 265 270 



Gly He Asn Asp Ser Val Pro He Met Lys Lys Leu Met His Asp Leu 
275 280 285 



Val Lys He Arg Val Arg Pro Tyr Tyr He Tyr Gin Cys Asp Leu Ser 
290 295 300 



Glu Gly He Arg His Phe Arg Ala Pro Val Ser Lys Gly Leu Glu He 
305 310 315 320 



He Glu Gly Leu Arg Gly His Thr Ser Gly Tyr Ala Va.1 Pro Thr Phe 
325 330 335 



Val Val His Ala Pro Gly Gly Gly Gly Lys He Ala Leu Gin Pro Asn 
340 345 350 



Tyr Val Leu Ser Gin Ser Pro Asp Lys Val He Leu Arrg Asn Phe Glu 
355 360 365 



Gly Val He Thr Ser Tyr Pro Glu Pro Glu Asn Tyr He Pro Asn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Phe Pro Glu Thr Ala Asp Lys Lys 
385 390 395 400 
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-56- 

Glu Pro He Gly Leu Ser Ala Leu Phe Ala Asj) Lys Glu Val Ser Ser 
405 410 415 

Thr Pro Glu Asn Val Asp Arg He Lys Arg Arg Glu Ala Tyr He Ala 
420 425 430 

Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg Gly Gin 
435 440 445 

Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys Lys Gin Lys Glu Thr 
450 455 460 

Glu Cys Gly Gly Asp Ser Ser 



<210> 35 
<211> 1416 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 35 

atgaaaaaca aatggtataa accgaaacgg cattggaagg- agatcgagtt atggaaggac 60 

gttccggaag agaaatggaa cgattggctt tgacagctga cacacactgt aagaacgtta 120 

gatgatttaa agaaagtcat taatctgacc gaggatgaag- aggaaggcgt ccgtatttct 180 

accaaaacga tccccttaaa tatcacacct tactatgcga. gcttaatgga tccagaaaac 240 

ccacgttgtc cggtacgcat gcagtctgtg ccgcttcctg aagaaatgca caaaacaaaa 300 

tacgatatgg aagacccgct tcatgaggat gaagattcac cggtacccgg tctgacacac 360 

cgctatcccg accgtgtgct gtttcttgtc acggatcaafc gttccgtgta ctgccgccac 420 

cgcacacgcc ggcgcttctc cggacaaatc ggaatgggcg tccccgaaaa acagcttgat 480 

gctgcaattg cttacatccg ggaaacaccc gaaatccgcg- attgtttaat ttcaggcggt 540 

gatgggctgc tcatcaacga ccaaatttta gaatatattt taaaagagct gcgcagcatt 600 

ccgcatctgg aagtcatccg catcggaaca cgtgctcccej tcgtctttcc gcagcgcatt 660 

accgatcatc tgtgcgagat attgaaaaaa catcatccgg tctggctgaa cacccatttt 720 

aacacaagca tcgaaatgac agaagaatcc gttgaggcat: atgaaaagct ggtgaacgcg 780 

ggagtgccgg tcggaaatca ggctgttgta ttagcaggta ttaatgattc ggttccaatt, 840 
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ataaaaaagc 


tcatgcatga cttggtaaaa atcagagtcc 


gticcttatta 


tatttaccaa 


900 


tgtgacctgt 


cagaaggaat aaggcatttc cgtgctcctg 


tttccaaagg 


tttggagatc 


960 


attgaagggc 


tgagaggtca tacctcaggc tatgcggttc 


cfcacctttgt 


cgttcacgca 


1020 


ccaggcggag 


gaggtaaaat cgccctgcag ccgaactatg 


tcctgtctca 


aagtcctgac 


1080 


aaagtgatct 


taagaaattt tgaaggtgtg attacgtcat 


atccggaacc 


agagaattat 


1140 


atccccaatc 


aggcagacgc ctattttgag tccgttttcc 


ctgaaaccgc 


tgacaaaaag 


1200 


gagccgatcg 


ggctgagtgc catttttgct gacaaagaag tttcgtctac 


acctgaaaat 


1260 


gtagacagaa 


tcaaacggcg tgaggcatac atccrcaaatc 


cg-gagcatga 


aacattaaaa 


132 0 


gatcggcgtg 


agaaaagagg tcagctcaaa gaaaagaaat 


tt ttggcgca 


gcagaaaaaa 


1380 


cagaaagaga 


ctgaatgcgg aggggattct tcataa 






1416 



<210> 36 
<211> 471 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 36 

Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu He Glu 
15 10 15 



Leu Trp Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Trp Gin 
20 25 30 

Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Val He Asn 



Leu Thr Glu Asp Glu Glu Glu Gly Val Arg He Senr Thr Lys Thr He 
50 55 60 



Pro Leu Asn He Thr Pro Tyr Tyr Ala Ser Leu Met Asp Pro Glu Asn 
65 70 75 ~ 80 



Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Pro Glu Glu Met 



His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu His Glu Asp Glu Asp 
100 105 no 
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Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro Asp Arg Val Leu Phe 
115 120 125 

Leu Val Thr Asp Gin Cys Ser Val Tyr Cys Arg His Arg Thr Arg Arg 
130 135 140 



Arg Phe Ser Gly Gin He Gly Met Gly Val Pro Glu Lys Gin Leu Asp 
145 150 155 160 



Ala Ala He Ala Tyr He Arg Glu Thr Pro Glu He Arg Asp Cys Leu 
165 170 175 



He Ser Gly Gly Asp Gly Leu Leu He Asn Asp Gin He Leu Glu Tyr 
180 185 190 

He Leu Lys Glu Leu Arg Ser He Pro His Leu Glu Val He Arg He 
195 200 205 



Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg He Thr Asp His Leu 

210 215 220 

Cvs Glu He Leu Lys Lys His His Pro Val Trp Leu Asn Thr His Phe 

225 230 235 240 



Asn Thr Ser He Glu Met Thr Glu Glu Ser Val Glu Ala Tyr Glu Lys 
245 250 255 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 265 270 

Gly He Asn Asp Ser Val Pro He He Lys Lys Leu Met His Asp Leu 
275 280 285 



Val Lys He Arg Val Arg Pro Tyr Tyr He Tyr Gin Cys Asp Leu Ser 
290 295 300 

Glu Gly He Arg His Phe Arg Ala Pro Val Ser Lys Gly Leu Glu He 
305 310 315 320 



He Glu Gly Leu Arg Gly His Thr Ser Gly Tyr Ala Val Pro Thr Phe 
325 330 335 
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Val Val His Ala Pro Gly Gly Gly Gly Lys lie Ala Leu Gin Pro Asn 
340 345 350 



Tyr Val Leu Ser Gin Ser Pro Asp Lys Val He Leu Arg Asn Phe Glu 
355 360 365 



Gly Val He Thr Ser Tyr Pro Glu Pro Glu Asn Tyr He Pro Asn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Phe Pro Glu Thr Ala Asp Lys Lys 
385 390 395 400 



Glu Pro He Gly Leu Ser Ala He Phe Ala Asp Lys Glu Val Ser Ser 
405 410 415 



Thr Pro Glu Asn Val Asp Arg He Lys Arg Arg Glu Ala Tyr He Ala 
420 425 430 



Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg Gly Gin 
435 440 445 



Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys Lys Gin Lys Glu Thr 
450 455 460 



Glu Cys Gly Gly Asp Ser Ser 



<210> 37 

<211> 1416 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 



atgaaaaaca 


aatggtataa 


accgaaacgg cattggaagg agatcgagtt 


atggaaggac 


60 


gttccggaag 


agaaatggaa 


cgattggctt tgacagctga cacacactgt 


aagaacgtta 


120 


gatgatttaa 


agaaagtcat 


taatctgacc gaggatgaag aggaaggcgt 


ccgta-fcttct 


180 


accaaaacga 


tccccttaaa 


tattacacct tactatgctt ctttaatgga 


ccccgacaat 


240 


ccgagatgcc 


cggtacgcat 


gcagtctgtg ccgctttctg aagaaatgca 


caaaacaaaa 


300 


tacgatatgg 


aagacccgct 


tcatgaggat gaagattcac cggtacccgg 


tctgacacac 


360 


cgctatccca 


accgtgtgct 


gtttcttgtc acgaatcaat gttccgtgta 


ctgccgccac 


420 
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-60- 



tgcacacgcc 


ggcgcttttc 


cggacaaatc 


ggaatgggcg tccccaaaaa acagcttgat 


480 


gctgcaattg 


cttatatccg 


ggaaacaccc 


gaaatccgcg 


actgtctg-tt gtctggcggt 


540 


gatgggctgc 


tcatcaacga 


ccaaatttta 


gaatatattt 


taaaagagct gcgcagcatt 


600 


ccgcatctgg 


aagtcattcg 


tatcggttct 


cgtgcgccag 


tcgtctttcc 


gcagcgcatt 


660 


accgatcatc 


tgtgcgagat 


attgaaaaaa 


tatcatccgg 


tctggctgaa cacccatttt 


720 


aacacaagca 


tcgaaatgac 


agaagaatcc 


gttgaggcat 


gtgaaaagct 


ggtgaacgcg 


780 


ggagtgccgg 


tcggaaatca 


ggctgtcgta 


ttagcaggta 


ttaatgattc 


ggttccaatt 


840 


afcgaaaaagc 


tcatgcatga 


cttggtaaaa 


atcagagtcc 


gtccttatta 


tatttaccaa 


900 


tgtgatctgt 


cagaaggaat 


agggcatttc 


cgtgctcctg 


tttccaaagg 


tttggagatc 


960 


attgaagggc 


tgagaggtca 


tacctcaggc 


tatgcggttc 


ctaccttfcgt 


cgttcacgca 


1020 


ccaggcggag 


gaggtaaaat 


cgccctgcag 


ccgaactatg tcctgtcaca 


aagtcctgac 


1080 


aaagtgatct 


taagaaattt 


tgaaggtgtg 


attacgtcat atccggaacc 


agagaattat 


1140 


atccccaatc 


aggcagacgc 


ctattttgag 


tccgttttcc 


ctgaaaccgc 


tgacaaaaag 


1200 


gagccgatcg 


ggctgagtgc 


catttttgct 


gacaaagaag 


tttcgtttac 


acctgaaaat 


1260 


gtagacagaa 


tcaaacggcg 


tgaggcatac 


atcgcaaatc 


cggagcatga aacattaaaa 


1320 


gatcggcgtg 


agaaaagaga 


tcagctcaaa 


gaaaagaaat 


ttttggcg-ca 


gcagaaaaaa 


1380 


cagaaagaga 


ctgaatgcgg 


aggggattct 


tcataa 






1416 



<210> 38 

<211> 471 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 

<400> 38 

Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu lie Glu 
1 5 10 15 

Leu Trp Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Trp Gin 
20 25 30 

Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Val lie Asn 
35 40 45 



WO 2006/047589 



PCT/US200S/038552 



Leu Thr Glu Asp Glu Glu Glu Gly Val Arg lie Ser Thr Lys Thr He 



Pro Leu Asn He Thr Pro Tyr Tyr Ala Ser Leu Met Asp Pro Asp Asn 



Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Glu Glu Met 



His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu His Glu Asp Glu Asp 
100 105 110 



Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro Asn Arg Val Leu Phe 
115 120 125 



Leu Val Thr Asn Gin Cys Ser Val Tyr Cys Arg His Cys Thr Arg Arg 
130 135 140 



Arg Phe Ser Gly Gin He Gly Met Gly Val Pro Lys Lys Gin Leu Asp 
145 150 155 160 



Ala Ala He Ala Tyr He Arg Glu Thr Pro Glu. He Arg Asp Cys Leu 
165 170 175 



Leu Ser Gly Gly Asp Gly Leu Leu He Asn Asp Gin He Leu Glu Tyr 
180 185 190 



He Leu Lys Glu Leu Arg Ser He Pro His Leu. Glu Val He Arg He 
195 200 205 



Gly Ser Arg Ala Pro Val Val Phe Pro Gin Arg- He Thr Asp His Leu 
210 ~ 215 220 



Cys Glu He Leu Lys Lys Tyr His Pro Val Trp> Leu Asn Thr His Phe 
225 230 235 240 



Asn Thr Ser He Glu Met Thr Glu Glu Ser Val. Glu Ala Cys Glu Lys 
245 250 255 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 " 265 270 



Gly He Asn Asp Ser Val Pro He Met Lys Lyss Leu Met His Asp Leu 
275 280 285 
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Val Lys lie Arg Val Arg Pro Tyr Tyr lie Tyr Gin Cys Asp Lou Ser 
290 295 300 

Glu Gly He Gly His Phe Arg Ala Pro Val Ser Lys Gly Leu Glu He 
305 310 315 320 

He Glu Gly Leu Arg Gly His Thr Ser Gly Tyr Ala Val Pro Tlir Phe 
325 330 335 



Val Val His Ala Pro Gly Gly Gly Gly Lys He Ala Leu Gin Prro Asn 
340 345 350 



Tyr Val Leu Ser Gin Ser Pro Asp Lys Val He Leu Arg Asn Plie Glu 
355 360 365 

Gly Val He Thr Ser Tyr Pro Glu Pro Glu Asn Tyr He Pro A.sn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Phe Pro Glu Thr Ala Asp Lys Lys 
385 390 395 400 

Glu Pro He Gly- Leu Ser Ala He Phe Ala Asp Lys Glu Val Ser Phe 
405 410 4 15 

Thr Pro Glu Asn Val Asp Arg He Lys Arg Arg Glu Ala Tyr r le Ala 
420 425 430 

Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg ft^sp Gin 
435 440 445 



Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys Lys Gin Lys Glu Thr 
450 455 460 



Glu Cys Gly Gly Asp Ser Ser 



<210> 39 

<211> 1416 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
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-63- 

<400> 39 

atgaaaaaca aatggtataa accgaaacgg cattggaagg agatcgagtt atggaaggac 60 

gttccggaag agaaatggaa cgattggctt tgacagctga cacacactgt aagaacgtta 120 

gafcgatttaa agaaagtcat taatctgacc gaggatgaag aggaaggcgt ccgtatttct 180 

accaaaacga tccccttaaa tattacacct tactatgctt ctttaatgga ccccgacaat 240 

ccgagatgcc cggtacgcat gcagtctgtg ccgctttctg aagaaatgca caaaacaaaa 300 

tacgatatgg aagacccgct tcatgaggat gaagattcac cggtacccgg tctgacacac 360 

cgctatcccg accgtgtgct gtttcttgtc acgaatcaat gttccgtgca ctgccgccac 420 

tgcacacgcc ggcgcttttc cggacaaatc ggaatgggcg tccccaaaaa acagcttgat 480 

gctgcaattg cttatatccg ggaaacaccc gaaatccgcg attgtttaat ttcaggcggt 540 

gatgggctgc tcatcaacga ccaaatttta gaatatattt taaaagagct gcgcagcatt 600 

ccgcacctgg aagtcatccg catcggaaca cgtgctcccg fccgtctttcc gcagcgcatt 660 

accgatcatc tgtgcgagat attgaaaaaa tatcatccgg tctggctgaa cacccatttt 720 

aacacaagca tcgaaatgac agaagaatcc gttgaggcat g-tgaaaagct ggtgaacgcg 780 

ggagtgccgg tcggaaatca ggctgtcgta ttagcaggta fctaatgattc ggttccaatt 840 

atgaaaaagc tcatgcatga cttggtaaaa atcagagtcc cjtccttatta tatttaccaa 900 

tgtgatotgt cagaaggaat aaggcatttc cgtgctcctg fcttccaaagg tttggagatc 960 

attgaagggc tgagaggtca tacctcaggc tatgcggttc otacctttgt cgttcacgca 1020 

ccaggcggag gtggtaaaat cgccctgcag ccgaactatg fccctgtctca aagtcctgac 1080 

aaagtgatct taagaaattt tgaaggtgtg attacgtcat atccggaacc agagaattat 1140 

atccccaatc aggcagacgc ctattttgag tccgttttcc ctgaaaccgc tgacaaaaag 1200 

gagccgatcg ggctgagtgc catttttgct ggcaaagaag tttcgtctac acctgaaaat 1260 

gtagtcagaa tcaaacggcg tgaggcatac atcgcaaatc cggagcatga aacattaaaa 1320 

gatcggcgtg agaaaagagg tcagctcaaa gaaaagaaat ttttggcgca gcagaaaaaa 1380 

cagaaagaga ctgaatgcgg aggggattct tcataa 1416 

<210> 40 
<211> 471 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
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<400> 40 

Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu lie Glu 



Leu Trp Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Trp Gin 



Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Val He Asn 



Leu Thr Glu Asp Glu Glu Glu Gly Val Arg He Ser Thr Lys Thr He 



Pro Leu Asn He Thr Pro Tyr Tyr Ala Ser Leu Met Asp Pro Asp Asn 



Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Glu Glu Met 



His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu His Glu Asp Glu Asp 
100 105 110 



Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro Asp Arg Val Leu Phe 
115 120 125 



Leu Val Thr Asn Gin Cys Ser Val His Cys Arg His Cys Thr Arg Arg 
130 135 140 



Arg Phe Ser Gly Gin He Gly Met Gly Val Pro Lys Lys Gin Leu Asp 
145 150 155 160 



Ala Ala He Ala Tyr He Arg Glu Thr Pro Glu He Arg Asp Cys Leu 
165 170 175 



He Ser Gly Gly Asp Gly Leu Leu He Asn Asp Gin He Leu Glu Tyr 
180 185 190 



He Leu Lys Glu Leu Arg Ser He Pro His Leu Glu Val He Arg He 
195 200 205 



Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg He Thr Asp His Leu 
210 215 220 



WO 2006/047589 



PCT/US2005/038552 



Cys Glu He Leu Lys Lys Tyr His Pro Val Trp Leu Asn Thr His Phe 
225 230 235 240 



Asn Thr Ser He Glu Met Thr Glu Glu Ser Val Glu Ala Cys Glu Lys 
245 250 255 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 265 270 



Gly He Asn Asp Ser Val Pro He Met Lys Lys Leu Met His Asp Leu 
275 280 285 



Val Lys He Arg Val Arg Pro Tyr Tyr He Tyr Gin Cys Asp Leu Ser 
290 295 300 



Glu Gly He Arg His Phe Arg Ala Pro Val Ser Lys Gly Leu Glu lie 
305 310 315 320 



He Glu Gly Leu Arg Gly His Thr Ser Gly Tyr Ala Val Pro Thr Phe 
325 330 335 



Val Val His Ala Pro Gly Gly Gly Gly Lys He Ala Leu Gin Pro Asn 
340 345 350 



Tyr Val Leu Ser Gin Ser Pro Asp Lys Val He Leu Arg Asn Phe Glu 
355 „ 360 365 



Gly Val He Thr Ser Tyr Pro Glu Pro Glu Asn Tyr He Pro Asn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Phe Pro Glu Thr Ala Asp Lys Lys 
385 390 395 400 



Glu Pro He Gly Leu Ser Ala He Phe Ala Gly Lys Glu Val Ser Ser 
405 410 415 



Thr Pro Glu Asn Val Val Arg He Lys Arg Arg Glu Ala Tyr He Ala 
420 425 430 



Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg Gly Gin 
435 440 445 



Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys Lys Gin Lys Glu Thr 
450 455 460 
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Glu Cys Gly Gly Asp Ser Ser 
465 470 

<210> 41 

<211> 1416 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 

<400> 41 



atgaaaaaca 


aatggtataa 


accgaaacgg 


cattggaagg 


agatcgagtt atggagggac 


60 


gtcccggaag 


agaaatggaa 


cgattggctt 


tgacagc tga 


cacacactgt aagaacgtta 


120 


gatgatttaa 


agaaagtcat 


taatctgacc 


gaggatg-aag 


aggaaggcgt ccgtatttct 


180 


accaaaacga 


tccccttaaa 


tattacacct 


tactatg-ctt 


ctttaatgga 


ccccgacaat 


240 


ccgaggtgcc 


cggtacgcat 


gcagtctgtg 


ccactgtctg 


aggaaatgca caaaagcaaa 


300 


tatgacatgg 


aagatccgct 


tcatgaggat 


gaagatfccac 


cggtacccgg tctgacacac 


360 


cgctatcccg 


accgtgtgct 


gtttcttgtc 


acgaatoaat 


gttccgtgta ctgccgccac 


420 


tgcacacgcc 


ggcgcttttc 


cggacaaatc 


ggaatgcjgcg 


tccccaaaaa acagcttgat 


480 


gctgcaattg 


cttatatccg 


ggaaacaccc 


gaaatcogcg 


attgtttaat 


ttcaggcggt 


540 


gatgggctgc 


tcatcaacga 


ccaaatttta 


gaatatattt 


taaaagagct 


gcgcagcatt 


600 


ccgcatctgg 


aagtcatccg 


catcggaaca 


cgtgctcccg 


tcgtctttcc 


gcagcgcatt 


660 


accgatcatc 


tgtgcgagat 


attgaaaaaa 


tatcatccgg 


tctggctgaa 


cacccatttt 


720 


aacacaagca 


tcgaaatgac 


agaagaatcc 


gttgaggcat 


gtgaaaagct 


ggtgaacgcg 


780 


ggagtgccgg 


tcggaaatca 


ggctgtcgta 


ttagcaggta 


ttaatgattc 


ggttccaatt 


840 


atgaaaaagc 


tcatgcatga 


cttggtaaaa 


atcagagtcc 


gtccttatta 


tatttaccaa 


900 


tgtgatctgt 


cagaaggaat 


aaggcatttc 


cgtgctcctg 


tttccaaagg 


tttggagatc 


960 


attgaagggc 


tgagaggtca 


tacctcaggc 


tatgcggttc 


ctacctttgt 


cgttcacgca 


1020 


ccgggcggag 


gaggtaaaat 


cgccctgcag 


ccgaac tatg 


tcctgtctca aagtcctgac 


1080 


aaagtgatct 


taagaaattt 


tgaaggtgtg 


attacg-tcat 


atccggaacc 


agagaattat 


1140 


atccccaatc 


aggcagacgc 


ctattttgag 


tccgtfc ttcc 


ctgaaaccgc 


tgacaaaaag 


1200 


gagccgatcg 


ggctgagtgc 


catttttgct 


gacaaa.gaag 


tttcgtctac acctgaaaat 


1260 


gtagacagaa 


tcaaacggcg 


tgaggcgtac 


atcgcaaatc 


cggagcatga aacattaaaa. 


1320 
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gatcggcgtg agaaaagagg tcagctcaaa gaaaagaaat ttttggcgca gcagaaaaaa 1380 
cagaaagaga ctgaatgcgg aggggattct tcataa 1416 

<210> 42 
<211> 471 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 42 

Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu He Glu 
15 10 15 

Leu Trp Arg Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Trp Gin 
20 25 3 0 

Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Val He Asn 
35 40 45 

Leu Thr Glu Asp Glu Glu Glu Gly Val Arg He Ser Thr L,ys Thr He 



Pro Leu Asn He Thr Pro Tyr Tyr Ala Ser Leu Met Asp Pro Asp Asn 
65 70 75 80 



Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Glu Glu Met 
85 90 95 



His Lys Ser Lys Tyr Asp Met Glu Asp Pro Leu His Glu Asp Glu Asp 
100 105 HQ 



Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro Asp Arg Val Leu Phe 
115 120 125 



Leu Val Thr Asn Gin Cys Ser Val Tyr Cys Arg His Cys Thr Arg Arg 
130 135 140 



Arg Phe Ser Gly Gin He Gly Met Gly Val Pro Lys Lys Gin Leu Asp 
1*5 150 155 ' 160 

Ala Ala He Ala Tyr He Arg Glu Thr Pro Glu He Arg Asp Cys Leu 
165 170 175 



WO 2006/047589 



PCT/US2005/038552 



lie Ser Gly Gly Asp Gly Leu Leu lie Asm Asp Gin lie Leu Glu Tyr 
180 185 190 



He Leu Lys Glu Leu Arg Ser He Pro His Leu Glu Val He Arg He 
195 200 205 



Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg He Thr Asp His Leu 
210 215 220 



Cys Glu He Leu Lys Lys Tyr His Pro Val Trp Leu Asn Thr His Phe 
225 230 235 240 



Asn Thr Ser He Glu Met Thr Glu Glu Ser Val Glu Ala Cys Glu Lys 
245 250 255 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 265 270 



Gly He Asn Asp Ser Val Pro He Met Lys Lys Leu Met His Asp Leu 
275 280 285 



Val Lys He Arg Val Arg Pro Tyr Tyr He Tyr Gin Cys Asp Leu Ser 
290 295 300 



Glu Gly He Arg His Phe Arg Ala Pro Val Ser Lys Gly Leu Glu He 
305 310 315 320 



He Glu Gly Leu Arg Gly His Thr Ser Gly Tyr Ala Val Pro Thr Phe 
325 330 335 



Val Val His Ala Pro Gly Gly Gly Gly Lys He Ala Leu Gin Pro Asn 
340 345 350 



Tyr Val Leu Ser Gin Ser Pro Asp Lys Val He Leu Arg Asn Phe Glu 
355 360 365 



Gly Val He Thr Ser Tyr Pro Glu Pro Glu Asn Tyr He Pro Asn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Phe Pro Glu Thr Ala Asp Lys Lys 
385 390 395 400 
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Glu Pro He Gly Leu Ser Ala He Phe Ala Asp Lys Glu Val Ser Ser 
405 410 415 

Thr Pro Glu Asn Val Asp Arg He Lys Arg Arg Glu Ala Tyr He Ala 
420 425 430 

Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg Gly Gin 
435 440 445 

Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys Lys Gin Lys Glu Thr 
450 455 460 

Glu Cys Gly Gly Asp Ser Ser 



<210> 43 
<211> 1416 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 43 

atgaaaaaca aatggtataa accgaaacgg cattggaagg agatcgagtt atggaaggac 60 

gttccggaag agaaatggaa cgattggctt tgacagctga cacacactgt aagaacgtta 120 

gatgatttaa agaaagtcat taatctgacc gaggatgaag aggaaggcgt ccgtatttct 180 

accaaaacga tccccttaaa tattacacca tactatgcga gcttaatgga tccagaaaac 240 

ccacgttgtc cggtacgcat gcagtctgtg ccgctttccg aagaaatgca caaaacaaaa 300 

tacgatatgg aagacccgct tcatgaggat gaagattcac cggtacccgg tctgacacac 360 

cgctatcccg accgtgtgct gtttcttgtc acgaatcaat gttccgtgta ctgccgccac 420 

tgcacacgcc ggcgcttttc cggacaaatc ggaatgggcg tccccaaaaa acagcttgat 480 

gctgcaattg cttatatccg ggaaacaccc gaaatccgcg attgtttaat ttcaggcggt 540 

gatgggctgc tcatcaacga ccaaatttta gaatatattt taaaagagct gcgcagcatt 600 

ccgcatctgg aagtcatccg catcggaaca cgtgctcccg tcgtctttcc gcagcgcatt 660 

accgatcatc cgtgcgagat attgaaaaaa tatcatccgg tctggctgaa cacccatttt 720 

aacacaagca tcgaaatgac agaagaatcc gttgaggcat gtgaaaagct ggtgaacgcg 780 

ggagtgccgg tcggaaatca ggctgtcgta ttagcaggta ttaatgattc ggttccaatt 840 

atgaaaaagc tcatgcatga cttggtaaaa atcagagtcc gtccttatta tatttaccaa 900 
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tgtgatctgt cagaaggaat aaggcatttc cgtgctcctg tctccaaagg- tttggagatc 960 

attgaagggc tgagaggtca taccccaggc tatgcggttc ctacctttgfc cgttcacgca 1020 

ccaggcggag gaggtaaaat cgccctgcag ccgaactatg tcctgtctca. aagtcctgac 1080 

aaagtgatct taagaaattt tgaaggtgtg attacgtcat atccggaacc agagaattat 1140 

atccccaatc aggcagacgc ctattttgag tccgtttccc ctgaaaccgc: tgacaaaaag 1200 

gagccgatcg ggctgagtgc catttttgct gacaaagaag tttcgtctac acctgaaaat 1260 

gtagacagaa tcaaacggcg tgaggcctac atcgcaaatc cggagcatga aacattaaaa 1320 

gatcggcgtg agaaaagagg tcagctcaaa gaaaagaaat tttcggcgcsi gcagaaaaaa 13 80 

cagaaagaga ctgaatgcgg aggggattct tcataa 1416 

<210> 44 
<211> 471 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 44 

Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu lie Glu 
15 10 15 

Leu Trp Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Trp Gin 
20 25 3 0 

Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Val He Asn 



Leu Thr Glu Asp Glu Glu Glu Gly Val Arg He Ser Thr L.;ys Thr He 
50 55 60 



Pro Leu Asn He Thr Pro Tyr Tyr Ala Ser Leu Met Asp Pro Glu Asn 



Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Slu Glu Met 



His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu His Glu TVsp Glu Asp 
100 105 LIO 
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Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro Asp Arg Val Leu Phe 
115 120 125 



Leu Val Thr Asn Gin Cys Ser Val Tyr Cys Arg His Cys Thr Arg Arg 
130 135 140 



Arg Phe Ser Gly Gin lie Gly Met Gly Val Pro Lys Lys Gin Leu Asp 
145 150 155 160 



Ala Ala lie Ala Tyr He Arg Glu Thr Pro Glu He Arg Asp Cys Leu. 

165 170 175 



He Ser Gly Gly Asp Gly Leu Leu He Asn Asp Gin He Leu Glu Tyr- 
180 185 190 



He Leu Lys Glu Leu Arg Ser He Pro His Leu Glu Val He Arg lies 
195 200 205 



Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg He Thr Asp His Pro 
210 215 220 



Cys Glu He Leu Lys Lys Tyr His Pro Val Trp Leu Asn Thr His Phe 
225 230 235 240 



Asn Thr Ser He Glu Met Thr Glu Glu Ser Val Glu Ala Cys Glu Lys 
245 250 255 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 265 270 



Gly He Asn Asp Ser Val Pro He Met Lys Lys Leu Met His Asp Leu 
275 . 280 285 



Val Lys He Arg Val Arg Pro Tyr Tyr He Tyr Gin Cys Asp Leu Sear 
290 295 300 



Glu Gly He Arg His Phe Arg Ala Pro Val Ser Lys Gly Leu Glu He 
305 310 315 32 O 



He Glu Gly Leu Arg Gly His Thr Pro Gly Tyr Ala Val Pro Thr Phe 
325 330 335 



Val Val His Ala Pro Gly Gly Gly Gly Lys He Ala Leu Gin Pro Asn 
340 345 350 
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Tyr Val Leu Ser Gin Ser Pro Asp Lys Val He Leu Arg Asn Phe Glu 
355 360 365 



Gly Val He Thr Ser Tyr Pro Glu Pro Glu Asn Tyr He Pro Asn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Ser Pro Glu Thr Ala Asp Lys Lys 
385 390 395 400 



Glu Pro He Gly Leu Ser Ala He Phe Ala Asp Lys Glu Val Ser Ser 
405 410 415 



Thr Pro Glu Asn Val Asp Arg He Lys Arg Arg Glu Ala Tyr He Ala 
420 425 430 



Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg Gly Gin 
435 440 445 



Leu Lys Glu Lys Lys Phe Ser Ala Gin Gin Lys Lys Gin Lys Glu Thr 
450 455 460 



Glu Cys Gly Gly Asp Ser Ser 



<210> 45 

<211> 1416 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 



atgaaaaaca 


aatggtataa 


accgaaacgg cattggaagg agatcgagtt 


ELcggaaggac 


60 


gttccggaag 


agaaatggaa 


cgattggctt tgacagctga cgcacactgt 


aagaacgtta 


120 


gatgatttaa 


agaaagtcat 


taatctgacc gaggatgaag aggaaggcgt 


ccgtatttct 


180 


accaaaacga 


tccccttaaa 


tattacacct tactatgcga gcttaattga 


tccagaaaac 


240 


ccacgttgtc 


cggtacgcat 


gcagtctgcg ccgctgtctg aagaaatgca 


caaaacaaaa 


300 


tacgatatgg 


aagacccgct 


tcatgaggat gaagattcac cggtacccgg 


tctgacacac 


360 


cgctatcccg 


accgtgtgct 


gtttcttgtc acgaatcaat gttccgtgta 


ctgccgccac 


420 


tgcacacgcc 


ggcgcttttc 


cggacaaatc ggaacgggcg tccccaaaaa 


acagcttgafc. 


480 
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gctgcaactg 


cttatatccg 


ggaaacaccc 


gaaatccgcg 


attgtttaat 


tccaggccjgt 


540 


gatgggctgc 


tcatcaacga 


ccaaatttta 


ggatatattt 


taaaagagct 


gcgcagcstt 


600 


ccgcatctgg 


aagtcatccg 


catcggaaca 


cgtgcccccg 


tcggctttcc 


gcagcgcatt 


660 


accgatcatc 


tgtgcgagat 


attgaaaaaa 


tatcatccgg 


tctggctgaa 


cacccatfctt 


720 


aacacaagca 


tcgaaatgac 


agaagaatcc 


gttgaggcat 


gtgaaaagct 


ggtgaacgjcg 


780 


ggagtgccgg 


tcggaaatca 


ggctgtcgta 


ttagcaggta 


ttaatgattc 


ggttccaatt 


840 


atgaaaaagc 


tcatgcatga 


cttggtaaaa 


atcagagtcc 


gtccttatta 


tatttaccaa 


900 


tgtgatctgt 


cagaaggaat 


aaggcatttc 


cgtgctcctg 


tttccaaagg 


tttggagatc 


960 


attgaagggc 


tgagaggtca 


tacctcaggc 


tatgcggttc 


ctacctttgfc 


cgttcacgca 


1020 


ccaggcggag 


gaggtaaaat 


cgccctgcag 


ccgaactatg 


ccctgtctca 


aagtcctgac 


1080 


aaagtgatct 


taagaaattt 


tgaaggtgtg 


attacgtcat 


atccggaacc 


agagaattat 


1140 


atccccaatc 


aggcagacgc 


ctattttgag 


tccgttttcc 


ctgaaaccgc 


tgacaaaaag 


1200 


gagccgatcg 


ggctgagtgc 


catttttgct 


gacaaagaag 


tttcgtctac 


acctgaaaat 


1260 


gtagacagaa 


tcaaacggcg 


tgaggcatac 


atcgcaaatc 


cggagcatga 


aacattaaaa 


1320 


gatcggcgtg 


agaaaagagg 


tcagctcaaa 


gaaaagaaat 


ttttggcgca 


gcagaaaaaa 


1380 


cagaaagaga 


ctgaatgcgg 


aggggattct 


tcataa 






1416 



<210> 46 
<211> 471 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 46 

Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu lie Glut 
15 10 15 

Leu Arg Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Trp Gin 
20 25 30 

Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Val lie Asn 
35 40 45 



Leu Thr Glu Asp Glu Glu Glu Gly Val Arg lie 
50 55 



Ser Thr Lys Thr Il-e 
60 
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Pro 



Leu .Asn lie Thr Pro Tyr Tyr Ala Ser Leu He Asp Pro Glu Asn 



Pro Arg Cys Pro Val Arg Met Gin Ser Ala Pro Leu Ser Glu Glu Met 
85 90 95 



His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu His Glu Asp Glu Asp 
100 105 110 



Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro Asp Arg Val Leu Phe 
115 120 125 



Leu Val Thr Asn Gin Cys Ser Val Tyr Cys Arg His Cys Thr Arg Arg 
130 135 140 



Arg Phe Ser Gly Gin He Gly Thr Gly Val Pro Lys Lys Gin Leu Asp 
145 150 155 160 



Ala Ala Thr Ala Tyr He Arg Glu Thr Pro Glu He Arg Asp Cys Leu 
165 170 175 



He Pro Gly Gly Asp Gly Leu Leu He Asn Asp Gin He Leu Gly Tyr 
180 " 185 190 



He Leu Lys Glu Leu Arg Ser He Pro His Leu Glu Val He Arg He 
195 200 205 



Gly Thr Arg Ala Pro Val Gly Phe Pro Gin Arg He Thr Asp His Leu 

210 215 220 

Cys Glu He Leu Lys Lys Tyr His Pro Val Trp Leu Asn Thr His Phe 

225 230 235 240 



Asn Thr Ser He Glu Met Thr Glu Glu Ser Val Glu Ala Cys Glu Lys 
245 250 255 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 265 270 



Gly He Asn Asp Ser Val Pro He Met Lys Lys Leu Met His Asp Leu 
275 " 280 285 
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Val Lys He Arg Val Arg Pro Tyx Tyr He Tyr Gin Cys Asp Leu Ser 
290 295 3O0 



Glu Gly He Arg His Phe Arg Ala Pro Val Ser Lys Gly Leu Glu He 
305 310 315 320 



He Glu Gly Leu Arg Gly His Thr Ser Gly Tyr A3.a Val Pro Thr Phe 
325 330 335 



Val Val His Ala Pro Gly Gly Gly Gly Lys He A.la Leu Gin Pro Asn 
340 345 350 



Tyr Ala Leu Ser Gin Ser Pro Asp Lys Val He Leu Arg Asn Phe Glu 
355 360 365 



Gly Val He Thr Ser Tyr Pro Glu Pro Glu Asn T-yx He Pro Asn Gin 
370 375 3 80 



Ala Asp Ala Tyr Phe Glu Ser Val Phe Pro Glu CTir Ala Asp Lys Lys 
385 390 395 400 



Glu Pro He Gly Leu Ser Ala He Phe Ala Asp L.ys Glu Val Ser Ser 
405 410 415 



Thr Pro Glu Asn Val Asp Arg He Lys Arg Arg Glu Ala Tyr He Ala 
420 425 430 



Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg Gly Gin 
435 440 445 



Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys XJys Gin Lys Glu Thr 
450 455 460 



Glu Cys Gly Gly Asp Ser Ser 



<210> 47 

<211> 1416 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 



<400> 47 

atggaaaaca aatggtataa accgaaacgg cattggaagg agatcgagtt atggaaggac 60 
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gttccggaag 


agaaatggaa 


cgattggctt 


tgacagctga cacaca_ctgt 


aagaacgtta 


120 


gatgatttaa 


agaaagtcat 


taatctgacc 


gaggatgaag aggaag-gcgt 


ccgtatttct 


180 


accaaaacga 


tccccttaaa 


tattacacct 


tactatgcga gcttaattga 


tccagaaaac 


240 


ccacgttgtc 


cggtacgcat 


gcagtctgtg 


ccgctttccg aagaaatgca caaaacaaaa 


300 


tacgatatgg 


aagatccgct 


tcatgaggat 


gaagattcac cggtacccgg cctgacacac 


360 


cgctatcccg 


accgtgtgct 


gtttcttgtc 


gcgaatcaat gttccg-tgta 


ctgccgccac 


420 


tgcacacgcc 


ggcgcttttc 


cggacaaatc 


ggaatgggcg tccccaaaaa 


acagcttgat 


480 


gctgcaattg 


cttatatccg 


ggaaacaccc 


gaaatccgcg attgtttaat 


ttcaggcggt 


540 


gatgggctgc 


tcatcaacga 


ccaaatttta 


gaatatattt taaaag-agct 


gcgcagcatt 


600 


ccgcatccgg 


aagtcatccg 


catcggaaca 


cgtgcccccg tcgtctttcc 


gcagcgcatt 


660 


accgatcatc 


tgtgcgagat 


attgaaaaaa 


tatcatccgg tctggctgaa cacccatttt 


720 


aacacaagca 


tcgaaatgac 


agaagaatcc 


gttgaggcat gtgaaaagct 


ggtgaacgcg 


780 


ggagtgccgg 


tcggaaatca 


ggctgtcgta 


ttagcaggta ttaatg-attc 


ggttccaatt 


840 


atgaaaaagc 


tcatgcatga 


ctfcggtaaaa 


atcagagtcc gtcctfcatta 


tatttaccaa 


900 


tgtgatctgt 


cagaaggaat 


aaggcatttc 


cgtgcccctg tttccaaagg 


tttggagatc 


960 


attgaagggc 


tgagaggtca 


tacctcaggc 


tgtgcggttc ctaccfcttgt 


cgttcacgca 


1020 


coaggcggag 


gaggtaaaat 


cgccctgcag 


ccgaactatg tcctgtctca aagtcctgac 


1080 


aaagtgatct 


taagaaattt 


tgaaggtgtg 


attacgtcat atccgg-aacc 


agagaattat 


1140 


atccccaacc 


aggcagacgc 


ctattttgag 


tccgttttcc ctgaaaccgc 


tgacaaaaag 


1200 


gagccgatcg 


ggctgagtgc 


catttttgct 


gacaaagaag tttcgfcctac 


acctgaaaat 


1260 


gtagacagaa 


tcaaacggcg 


tgaggcatac 


atcgcaaatc cggagcatga aacattaaaa 


1320 


gatcggcgtg 


agaaaagggg 


tcagctcaaa 


gaaaagaaat ttttgcjcgca 


gcagaaaaaa 


1380 


cagaaagaga 


ctgaatgcgg 


aggggattct 


tcataa 




1416 



<210> 48 

<211> 471 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 



<400> 48 
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Met Glu Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu lie Glu 



Leu Trp Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Trp Gin 



Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Val lie Asn 
35 40 45 



Leu Thr Glu Asp Glu Glu Glu Gly Val Arg lie Ser Thr Lys Thr lie 



Pro Leu Asn lie Thr Pro Tyr Tyr Ala Ser Leu lie Asp Pro Glu Asn 



Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Glu Glu Met 



His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu His Glu Asp Glu Asp 
100 105 110 



Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro Asp Arg Val Leu Phe 
115 120 125 



Leu Val Ala Asn Gin Cys Ser Val Tyr Cys Arg His Cys Thr Arg Arg 
130 135 140 



Arg .Phe Ser Gly Gin lie Gly Met Gly Val Pro Lys Lys Gin Leu Asp 
145 150 155 ISO 



Ala Ala He Ala Tyr He Arg Glu Thr Pro Glu He Arg- Asp Cys Leu 
165 170 " 175 



He Ser Gly Gly Asp Gly Leu Leu He Asn Asp Gin He Leu Glu Tyr 
180 185 190 



He Leu Lys Glu Leu Arg Ser He Pro His Pro Glu Val He Arg He 
195 200 205 



Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg He Thr Asp His Leu 
210 215 220 



Cys Glu He Leu Lys Lys Tyr His Pro Val Trp Leu Asn Thr His Phe 
225 230 235 240 
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Asn Thr Ser He Glu Met Thr Glu Glu Ser Val Glu Ala Cys Glu Lys 
245 2 50 255 



Leu Val Asn Ala Gly Val Pro Val Gly iVsn Gin Ala Val Val Leu Ala 
260 265 270 



Gly He Asn Asp Ser Val Pro He Met Lys Lys Leu Met His Asp Leu 
275 280 285 



Val Lys He Arg Val Arg Pro Tyr Tyr Xle Tyr Gin Cys Asp Leu Ser 
290 295 300 



Glu Gly He Arg His Phe Arg Ala Pro Val Ser Lys Gly Leu Glu He 
305 310 315 ' 320 



He Glu Gly Leu Arg Gly His Thr Ser Gly Cys Ala Val Pro Thr Phe 
325 330 335 



Val Val His Ala Pro Gly Gly Gly Gly Lys He Ala Leu Gin Pro Asn 
340 345 350 



Tyr Val Leu Ser Gin Ser Pro Asp Lys \/al He Leu Arg Asn Phe Glu 
355 360 365 



Gly Val He Thr Ser Tyr Pro Glu Pro Glu Asn Tyr He Pro Asn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Phe 3?ro Glu Thr Ala Asp Lys Lys 
385 390 395 400 



Glu Pro He Gly Leu Ser Ala He Phe Ala Asp Lys Glu Val Ser Ser 
405 410 415 



Thr Pro Glu Asn Val Asp Arg He Lys Arg Arg Glu Ala Tyr lie Ala 
420 425 430 



Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg Gly Gin 
435 440 445 



Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys Lys Gin Lys Glu Thr 
450 455 460 
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Glu Cys Gly Gly Asp Ser Ser 
465 470 

<210> 49 
<211> 1416 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 49 

atgaaaaaca aatggtataa accgaaacgg cattggaagcj agatcgagtt atggaaggac 60 

gttccggaag agaaatggaa cgattggctt tgacagctga. cacacactgt aagaacgtta 120 

gatgatttaa agaaagtcat taatctgacc gaggatgaacj aggaaggcgt ccgtatttct 180 

accaaaacga tccccttaaa tattacacct tactaggttt ctttaatgga ccccgacaat 240 

ccgagatgcc cggtacgcat gcagtctgtg ccactgtctgr aagaaatgca caaaacaaaa 300 

tacgatatgg aagacccgct tcatgaggat gaagattcac cggtacccgg tctgacacac 360 

cgctatcccg accgtgtgct gtttcttgtc acgaatcaat gttccgtgta ctgccgccac 420 

tgcacacgcc ggcgcttttc cggacaaatc ggaatgggccj tccccaaaaa acagcttgat 480 

gctgcaattg cttatatccg ggaaacaccc gaaatccgcg attgtttaat ttcaggcggt 540 

gatgggctgc tcatcaacga ccaaatttta gaatatattt taaaagagct gcgcagcatt 600 

ccgcatctgg aagtcatccg catcggaaca cgtgctcccg tcgtctttcc gcagcgcatt 660 

accgatcatc tgtgcgagat attgaaaaaa tatcatccgg tctggctgaa cacccatttt 720 

aacacaagca tcgaaatgac agaagaatcc gttgaggcat gtgaaaagct ggtgaacgcg 780 

ggagtgccgg tcggaaatca ggctgtcgta ttagcaggta ttaatgattc ggttccaatt 840 

atgaaaaagc tcatgcatga cttggtaaaa atcagagtcc gtccttatta tatttaccaa 900 

tgtgatctgt cagaaggaat aaggcatttc cgtgctcctg tttccaaagg tttggagatc 960 

attgaagggc tgagaggtca cacctcaggc aatgcggttc ccacctttgt cgttcacgca 1020 

ccaggcggag gaggtaaaat cgccctgcag ccgaactatg tcctgtctca aagtcctgac 1080 

aaagtgatct taagaaattt tgaaggtgtg attacgtca.t atccggaacc agagaattat 1140 

atccccaatc aggcagacgc ctattttgag tccgttttcc ctgaaaccgc tgacaaaaag 1200 

gagccgatcg ggctgagtgc catttttgct gacaaagaag tttcgtctac acctgaaaat 1260 

gtagacagaa tcaaacggcg tgaggcatac atcgcaaatc cggagcatga aacattaaaa 1320 

gatcggcgtg agaaaagagg tcagctcaaa gaaaagaast ttttggcgca gcagaaaaaa 1380 
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cagaaagaga ctgaatgcgg aggggattct tcataa 

<210> 50 
<211> 71 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 50 

Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu He Glu 
15 10 15 

Leu Trp Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Trp Gin 
20 25 30 

Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Val He Asn 
35 40 45 

Leu Thr Glu Asp Glu Glu Glu Gly Val Arg He Ser Thr Lys Thr He 
50 55 60 

Pro Leu Asn He Thr Pro Tyr 
65 70 

<210> 51 
<211> 399 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 51 

Val Ser Leu Met Asp Pro Asp Asn Pro Arg Cys Pro Val Arg Met Gin 
15 10 15 

Ser Val Pro Leu Ser Glu Glu Met His Lys Thr Lys Tyr Asp Met Glu 
20 25 30 

Asp Pro Leu His Glu Asp Glu Asp Ser Pro Val Pro Gly Leu Thr His 
35 40 45 

Arg Tyr Pro Asp Arg Val Leu Phe Leu Val Thr Asn Gin Cys Ser Val 
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Tyr Cys Arg His Cys Thr Arg Arg Arg Phe Ser Gly Gin lie Gly Met 



Gly Val Pro Lys Lys Gin Leu Asp Ala Ala lie Ala Tyr lie Arg Glu 



Thr Pro Glu lie Arg Asp Cys Leu He Ser Gly Gly Asp Gly Leu Leu 
100 105 110 



He Asn Asp Gin He Leu Glu Tyr He Leu Lys Glu Leu Arg Ser He 
115 120 125 



Pro His Leu Glu Val He Arg He Gly Thr Arg Ala Pro Val Val Phe 
130 135 140 



Pro Gin Arg He Thr Asp His Leu Cys Glu He Leu Lys Lys Tyr His 
145 150 155 160 



Pro Val Trp Leu Asn Thr His Phe Asn Thr Ser He Glu Met Thr Glu 
165 170 175 



Glu Ser Val Glu Ala Cys Glu Lys Leu Val Asn Ala Gly Val Pro Val 
180 185 190 



Gly Asn Gin Ala Val Val Leu Ala Gly He Asn Asp Ser Val Pro He 
195 200 205 



Met Lys Lys Leu Met His Asp Leu Val Lys He Arg Val Arg Pro Tyr 
210 215 220 



Tyr He Tyr Gin Cys Asp Leu Ser Glu Gly He Arg His Phe Arg Ala 
225 230 235 240 



Pro Val Ser Lys Gly Leu Glu He He Glu Gly- Leu Arg Gly His Thr 
245 250 255 



Ser Gly Asn Ala Val Pro Thr Phe Val Val His Ala Pro Gly Gly Gly 
260 265 270 



Gly Lys He Ala Leu Gin Pro Asn Tyr Val Leu Ser Gin Ser Pro Asp 
275 280 285 
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Lys Val lie Leu Arg Asn Phe Glu Gly Val He Thr Ser Tyr Pro Glu 
290 295 300 



Pro Glu Asn Tyr He Pro Asn Gin Ala Asp Ala Tyr Phe Glu Ser Val 
305 310 315 320 



Phe Pro Glu Thr Ala Asp Lys Lys Glu Pro He Gly Leu Ser Ala He 
325 330 335 



Phe Ala Asp Lys Glu Val Ser Ser Thr Pro Glu Asn Val Asp Arg He 
340 345 350 



Lys Arg Arg Glu Ala Tyr He Ala Asn Pro Glu His Glu Thr Leu Lys 
355 360 365 



Asp Arg Arg Glu Lys Arg Gly Gin Leu Lys Glu Lys Lys Phe Leu Ala 
370 375 380 



Gin Gin Lys Lys Gin Lys Glu Thr Glu Cys Gly Gly Asp Ser Ser 



<210> 52 

<211> 1245 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<220> 

<221> misc_feature 

<223> This parental sequence is a modification of the wild-type KAM of 
Clostridium stricklandii 

<220> 

<221> CDS 

<222> (1)..(1245) 

<400> 52 

atg agt tta aag gat aag ttt ttt aca cat gta age caa gaa gat tgg 48 
Met Ser Leu Lys Asp Lys Phe Phe Thr His Val Ser Gin Glu Asp Trp 
15 10 15 

aat gat tgg aaa tgg caa gta aga aat cgt ata aag act gtt gaa gaa 96 
Asn Asp Trp Lys Trp Gin Val Arg Asn Arg He Lys Thr Val Glu Glu 
20 25 30 

ctt aaa aaa tat att cca ctt act cca gaa gaa gaa gaa ggg gta aaa 144 
Leu Lys Lys Tyr He Pro Leu Thr Pro Glu Glu Glu Glu Gly Val Lys 
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cgc tgt ctt gat aca tta cgt atg get att act cca tac tat eta teg 192 
Arg Cys Leu Asp Thr Leu Arg Met Ala lie Thr Pro Tyr Tyr Leu Ser 
50 55 60 

eta att gat gta gaa aat cca aat gac cct gta aga aag caa get gta 240 
Leu He Asp Val Glu Asn Pro Asn Asp Pro Val Arg Lys Gin Ala Val 
65 70 75 80 

cct ctt tct tta gag ctg cat cgc gca gcg tct gat atg gaa gac cca 288 
Pro Leu Ser Leu Glu Leu His Arg Ala Ala Ser Asp Met Glu Asp Pro 
85 90 " 95 

ctt cat gaa gat gga gat tct cca gtt cca gga ctt aca cat cgc tat 336 
Leu His Glu Asp Gly Asp Ser Pro Val Pro Gly Leu Thr His Arg Tyr 
100 . 105 110 

cct gat cgc gtt ctt ctt tta atg act gat caa tgt tea gta tac tgc 384 
Pro Asp Arg Val Leu Leu Leu Met Thr Asp Gin Cys Ser Val Tyr Cys 
115 120 125 

cgc cac tgt act cgt aga cgc ttc get ggt cga aca gat tct get gtt 432 
Arg His Cys Thr Arg Arg Arg Phe Ala Gly Arg Thr Asp Ser Ala. Val 
130 135 140 

gat acg aag caa ata gat get gcg att gaa tat ate aaa aat act cca 480 
Asp Thr Lys Gin He Asp Ala Ala He Glu Tyr He Lys Asn Thr- Pro 
145 150 155 160 

caa gta aga gac gtt eta ctt tea gga gga gat get eta tta ate tea 528 
Gin Val Arg Asp Val Leu Leu Ser Gly Gly Asp Ala Leu Leu He Ser 
165 170 175 

gat gaa aag ctt gag tac aca ate aga aga ctt cgt gaa ata cca. cac 576 
Asp Glu Lys Leu Glu Tyr Thr He Arg Arg Leu Arg Glu He Pro His 
180 185 190 

gtt gag gtt att cgt att gga tea cgt gta cca gtt gta atg cca caa 624 
Val Glu Val He Arg He Gly Ser Arg Val Pro Val Val Met Pro Gin 
195 200 205 

cgt att aca cca gaa eta gtt tct atg ctt aaa aag tat cat cca gta 672 
Arg He Thr Pro Glu Leu Val Ser Met Leu Lys Lys Tyr His Pro Val 
210 215 220 

tgg tta aat aca cac ttc aac cat cct aat gaa att act gaa gag tct 720 
Trp Leu Asn Thr His Phe Asn His Pro Asn Glu He Thr Glu Glu Ser 
225 230 235 240 

aaa cgt gca tgt gag tta ctt get gat gca ggt att cct ctt gga aat 7 68 

Lys Arg Ala Cys Glu Leu Leu Ala Asp Ala Gly He Pro Leu Gl^ Asn 
245 250 255 

caa agt gtg ctt ctt gca ggt gta aat gat tgc atg cac gtt atcj aaa 816 
Gin Ser Val Leu Leu Ala Gly Val Asn Asp Cys Met His Val Met Lys 
260 265 270 

aaa eta gta aat gac tta gtt aaa ata cgc gta cgt cct tac tat att 864 
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Lys Leu Val Asn Asp Leu Val Lys lie Arg Val Arg Pro Tyr Tyr lie 
275 280 285 

tat caa tgt gac ctt tea gtt gga att gag cac ttt cgc act cca gtt 912 
Tyr Gin Cys Asp Leu Ser Val Gly lie Glu His Phe Arg Thr Pro Val 
290 295 300 

gca aag gga ata gaa ata att gaa ggc tta aga gga cat act tea gga 960 
Ala Lys Gly He Glu He He Glu Gly Leu Arg Gly His Thr Ser Gly 
305 310 315 320 

tac tgc gtt cct aca ttt gtt gtg cat gca cct ggt ggt gga gga aaa 1008 
Tyr Cys Val Pro Thr Phe Val Val His Ala Pro Gly Gly Gly Gly Lys 
325 330 335 

act cca gtt atg cca aac tat gtt att tea caa aat cac aat aaa gtt 1056 
Thr Pro Val Met Pro Asn Tyr Val He Ser Gin Asn His Asn Lys Val 
340 345 350 

att tta cgt aac ttt gaa ggt gta att aca act tac gat gag cct gat 1104 
He Leu Arg Asn Phe Glu Gly Val He Thr Thr Tyr Asp Glu Pro Asp 
355 360 365 

cat tat act ttc cac tgt gac tgt gat gta tgc act gga aaa aca aat 1152 
His Tyr Thr Phe His Cys Asp Cys Asp Val Cys Thr Gly Lys Thr Asn 
370 375 380 

gtt cat aag gtt gga gta get gga ctt eta aat gga gag aca gcg aca 1200 
Val His Lys Val Gly Val Ala Gly Leu Leu Asn Gly Glu Thr Ala Thr 
385 390 395 400 

ctt gaa cct gag ggt ttg gaa aga aaa caa aga gga cat cac taa 1245 
Leu Glu Pro Glu Gly Leu Glu Arg Lys Gin Arg Gly His His 
405 410 



<210> 53 
<211> 414 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 53 

Met Ser Leu Lys Asp Lys Phe Phe Thr His Val Ser Gin Glu Asp Trp 
15 10 15 



Asn Asp Trp Lys Trp Gin Val Arg Asn Arg He Lys Thr Val Glu Glu 
20 25 30 



Leu Lys Lys Tyr He Pro Leu Thr Pro Glu Glu 
35 40 



Glu Glu Gly Val Lys 
45 
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Arg Cys Leu Asp Thr Leu Arg Met Ala He Thr Pro Tyr Tyr Leu Ser 
50 55 60 

Leu He Asp Val Glu Asn Pro Asn Asp Pro Val Arg Lys Gin Ala Val 
65 70 75 80 

Pro Leu Ser Leu Glu Leu His Arg Ala Ala Ser Asp Met Glu Asp Pro 



Leu His Glu Asp Gly Asp Ser Pro Val Pro Gly Leu Thr His Arg Tyr 
100 105 HO 



Pro Asp Arg Val Leu Leu Leu Met Thr Asp Gin Cys Ser Val Tyr Cys 
115 120 125 



Arg His Cys Thr Arg Arg Arg Phe Ala Gly Arg Thr Asp Ser Ala Val 
130 135 140 



Asp Thr Lys Gin He Asp Ala Ala He Glu Tyr He Lys Asn Thr Pro 
145 150 155 ' 160 



Gin Val Arg Asp Val Leu Leu Ser Gly Gly Asp Ala Leu Leu He Ser 
165 170 175 



Asp Glu Lys Leu Glu Tyr Thr He Arg Arg Leu Arg Glu He Pro His 
180 185 190 



Val Glu Val He Arg He Gly Ser Arg Val Pro Val Val Met Pro Gin 
195 200 205 



Arg He Thr Pro Glu Leu Val Ser Met Leu Lys Lys Tyr His Pro Val 
210 215 220 



Trp Leu Asn Thr His Phe Asn His Pro Asn Glu He Thr Glu Glu Ser 
225 230 235 240 



Lys Arg Ala Cys Glu Leu Leu Ala Asp Ala Gly He Pro Leu Gly Asn 
245 250 255 



Gin Ser Val Leu Leu Ala Gly Val Asn Asp Cys Met His Val Met Lys 
260 265 270 



Lys Leu Val Asn Asp Leu Val Lys He Arg Val Arg Pro Tyr Tyr He 
275 280 285 
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Tyr Gin Cys Asp Leu Ser Val Gly He Glu His Phe Arg Thr Pro Val 
290 295 300 



Ala Lys Gly He Glu He He Glu Gly Leu Arg Gly His Thr Ser Gly 
305 310 315 320 



Tyr Cys Val Pro Thr Phe Val Val His Ala Pro Gly Gly Gly Gly Lys 
325 330 335 



Thr Pro Val Met Pro Asn Tyr Val He Ser Gin Asn His Asn Lys Val 
340 345 350 



He Leu Arg Asn Phe Glu Gly Val He Thr Thr Tyr Asp Glu Pro Asp 
355 360 365 



His Tyr Thr Phe His Cys Asp Cys Asp Val Cys Thr Gly Lys Thr Asn 
370 375 380 



Val His Lys Val Gly Val Ala Gly Leu Leu Asn Gly Glu Thr Ala Thr 
385 390 395 400 



Leu Glu Pro Glu Gly Leu Glu Arg Lys Gin Arg Gly His His 
405 410 



<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Synthetic Construct 



<220> 

<221> CDS 

<222> (1)..{1251) 



<400> 54 

atg gca gaa agt cgt aga aag tat tat ttc cct gat gtc acc gat gag 
Met Ala Glu Ser Arg Arg Lys Tyr Tyr Phe Pro Asp Val Thr Asp Glu 



caa tgg tac gac tgg cat tgg cag gtc etc aat cga att aag acg etc 
Gin Trp Tyr Asp Trp His Trp Gin Val Leu Asn Arg He Lys Thr Leu 



gac cag ctg aaa aag tac gtt aca etc acc get gaa gaa gaa gag gga 
Asp Gin Leu Lys Lys Tyr Val Thr Leu Thr Ala Glu Glu Glu Glu Gly _ 
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35 40 45 

gta aaa gaa teg ccc aaa gta etc cga atg get ate aca cct tat tat 192 

Val Lys Glu Ser Pro Lys Val Leu Arg Met Ala lie Thr Pro Ty r Tyr 
50 55 60 

ttg agt ttg ata gac ccc gag aat cct aat tgt ccg att cgt aaa caa 240 

Leu Ser Leu lie Asp Pro Glu Asn Pro Asn Cys Pro He Arg Lys Gin 

65 70 75 80 

gec att cct act caa cag gaa ctg gta cgt get cct gaa gat cag gta 2 88 

Ala He Pro Thr Gin Gin Glu Leu Val Axg Ala Pro Glu Asp Gin Val 
85 90 95 

gac cca ctt agt gaa gat gaa gat teg ccc gta ccc gga ctg act cat 336 

Asp Pro Leu Ser Glu Asp Glu Asp Ser Pro Val Pro Gly Leu Thr His 
100 105 110 

cgt tat ccg gat cgt gta ttg ttc ctt ate acg gac aaa tgt teg atg 384 

Arg Tyr Pro Asp Arg Val Leu Phe Leu He Thr Asp Lys Cys Ser Met 
115 120 125 

tac tgt cgt cat tgt act cgc cgt cgc ttc gca gga cag aaa gat get 432 

Tyr Cys Arg His Cys Thr Arg Arg Arg Phe Ala Gly Gin Lys Asp Ala 
130 135 140 

tct tct cct tct gag cgc ate gat cga tgc att gac tat ata gec aat 480 

Ser Ser Pro Ser Glu Arg He Asp Arg Cys He Asp Tyr He Ala Asn 

145 150 155 160 

aca ccg aca gtc cgc gat gtt ttg eta teg gga ggc gat gec etc ctt 52 8 

Thr Pro Thr Val Arg- Asp Val Leu Leu Ser Gly Gly Asp Ala Leu Leu 
165 170 175 

gtc age gac gaa cgc ttg gaa tac ata ttg aag cgt ctg cgc gaa gta 576 

Val Ser Asp Glu Arg; Leu Glu Tyr He Leu Lys Arg Leu Arg Glu Val 
180 185 190 

cct cat gtg gag att gtt cgt ata gga age cgt acg ccg gta gtc etc 624 

Pro His Val Glu' He Val Arg He Gly Ser Arg Thr Pro Val Val Leu 
195 200 205 

cct cag cgt ata acg- cct caa ttg gtg gat atg etc aaa aaa tat cat 672 

Pro Gin Arg He Thr- Pro Gin Leu Val Asp Met Leu Lys Lys Tyr His 
210 215 220 

ccg gtg tgg ctg aac act cac ttc aac cac ccg aat gaa gtt acc gaa 720 

Pro Val Trp Leu Asn. Thr His Phe Asn His Pro Asn Glu Val Thr Glu 

225 230 235 240 

gaa gca gtg gag get tgt gaa aga atg gec aat gee ggt att ccg ttg 768 

Glu Ala Val Glu Ala. Cys Glu Arg Met Ala Asn Ala Gly He Pro Leu 
245 250 255 

ggt aac caa acg gtt tta ttg cgt gga ate aat gat tgt aca cat gtg 816 

Gly Asn Gin Thr Val Leu Leu Arg Gly He Asn Asp Cys Thr His Val 
260 265 270 
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atg aag aga ttg gta cat ttg ctg gta aag atg cgt gtg cgt cct tac 
Met Lys Arg Leu Val His Leu Leu Val Lys Met Arg Val Arg Pro Tyr 
275 280 285 

tat ata tat gta tgc gat ctt teg ctt gga ata ggt cat ttc cgc acg 
Tyr lie Tyr Val Cys Asp Leu Ser Leu Gly He Gly His Phe Ajrg Thr 
290 295 300 

ccg gta tct aaa gga ate gaa att ate gaa aat ttg cgc gga cac acc 
Pro Val Ser Lys Gly He Glu He He Glu Asn Leu Arg Gly His Thr 
305 310 315 320 

teg ggc tat gca gtt cct acc ttt gtg gta ggt get ccg ggg g-gt ggt 
Ser Gly Tyr Ala Val Pro Thr Phe Val Val Gly Ala Pro Gly Gly Gly 
325 330 335 

ggt aag a.ta cct gta acg ccg aac tat gtt gta tct cag tec cca cga 
Gly Lys He Pro Val Thr Pro Asn Tyr Val Val Ser Gin Ser Pro Arg 
340 345 350 

cat gtg gtt ctt cgc aat tat gaa ggt gtt ate aca acc tat acg gag 
His Val Val Leu Arg Asn Tyr Glu Gly Val He Thr Thr Tyr Thr Glu 
355 360 365 

ccg gag aat tat cat gag gag tgc gat tgt gag gac tgt cga ejee ggt 
Pro Glu Asn Tyr His Glu Glu Cys Asp Cys Glu Asp Cys Arg Ala Gly 
370 375 380 

aag cat aaa gag ggt gta get gca ctt tec gga ggt cag cag ttg get 
Lys His Lys Glu Gly Val Ala Ala Leu Ser Gly Gly Gin Gin Leu Ala 
385 390 395 400 

ate gag cct tec gac tta get cgc aaa aaa cgc aag ttt gat aag aac 
He Glu Pro Ser Asp Leu Ala Arg Lys Lys Arg Lys Phe Asp Lys Asn 
405 410 415 



<210> 5 5 

<211> 416 

<212> PUT 

<213> Artificial Sequence 
<220> 

<223> Srynthetic Construct 

<400> 5 5 

Met Ala Glu Ser Arg Arg Lys Tyr Tyr Phe Pro Asp Val Thr Asp Glu 



10 



15 



Gin Trp Tyr Asp Trp His Trp Gin Val Leu Asn Arg He Lys Thr Leu 
20 25 30 



Asp Gin Leu Lys Lys Tyr Val Thr Leu Thr Ala Glu Glu Glu Olu Gly _ 
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35 40 45 

Val Lys Glu Ser Pro Lys Val Leu Arg Met Ala He Thr Pro Tyr Tyr 
50 55 60 

Leu Ser Leu lie Asp Pro Glu Asn Pro Asn Cys Pro He Arg Lys Gin 



Ala He Pro Thr Gin Gin Glu Leu Val Arg Ala Pro Glu Asp Gin Val 
85 90 95 



Asp Pro Leu Ser Glu Asp Glu Asp Ser Pro Val Pro Gly Leu Thr His 
1O0 105 110 



Arg Tyr Pro Asp Arg Val Leu Phe Leu He Thr Asp Lys Cys Ser Met 
115 120 125 



Tyr Cys Arg His Cys Thr Arg Arg Arg Phe Ala Gly Gin Lys Asp Ala 
130 135 140 



Ser Ser Pro Ser Glu Arg He Asp Arg Cys He Asp Tyr He Ala Asn 
145 150 155 160 



Thr Pro Thr Val Arg Asp Val Leu Leu Ser Gly Gly Asp Ala Leu Leu 
165 170 175 



Val Ser Asp Glu Arg Leu Glu Tyr He Leu Lys Arg Leu Arg Glu Val 
ISO 185 190 



Pro His Val Glu He Val Arg He Gly Ser Arg Thr Pro Val Val Leu 
195 200 205 



Pro Gin Arg lie Thr Pro Gin Leu Val Asp Met Leu Lys Lys Tyr His 
210 215 220 



Pro Val Trp Leu Asn Thr His Phe Asn His Pro Asn Glu Val Thr Glu 
225 230 235 240 



Glu Ala Val Glu Ala Cys Glu Arg Met Ala Asn Ala Gly He Pro Leu 
245 250 255 



Gly Asn Gin Thr Val Leu Leu Arg Gly He Asn Asp Cys Thr His Val 
260 265 ~ 270 
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Met Lys Arg Leu Val His Leu Leu Val Lys Met Arg Val Arg Pro Tyr 
275 280 285 



Tyr lie Tyr Val Cys Asp Leu Ser Leu Gly lie Gly His Phe Arg Thr 
290 295 300 



Pro Val Ser Lys Gly lie Glu lie lie Glu Asn Leu Arg- Gly His Thr 
305 310 315 320 



Ser Gly Tyr Ala Val Pro Thr Phe Val Val Gly Ala Pro Gly Gly Gly 
325 330 335 



Gly Lys lie Pro Val Thr Pro Asn Tyr Val Val Ser Gin Ser Pro Arg 
340 345 350 



His Val Val Leu Arg Asn Tyr Glu Gly Val lie Thr Thr Tyr Thr Glu 
355 360 365 



Pro Glu Asn Tyr His Glu Glu Cys Asp Cys Glu Asp Cys Arg Ala Gly 
370 375 380 



Lys His Lys Glu Gly Val Ala Ala Leu Ser Gly Gly Gin Gin Leu Ala 
385 390 395 400 



lie Glu Pro Ser Asp Leu Ala Arg Lys Lys Arg Lys Phe Asp Lys Asn 
405 410 415 



<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Synthetic Construct 



<220> 

<221> CDS 

<222> (1)..(1278) 



<400> 56 

atg aat aca gtt aat act cgt aaa aaa ttt ttc cca aat gta act gat 
Met Asn Thr Val Asn Thr Arg Lys Lys Phe Phe Pro Asn Val Thr Asp 



gaa gaa tgg aat gat tgg aca tgg caa gta aaa aac cgc ctt aaa agt 
Glu Glu Trp Asn Asp Trp Thr Trp Gin Val Lys Asn Arg Leu Lys Ser 
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gtt gaa gat tta gaa aaa tat gtt gat tta agt gaa gaa gaa aca gaa 144 
Val Glu Asp Leu Glu Lys Tyr Val Asp Leu Ser Glu Glu Glu Thr Glu 
35 40 4 5 

ggg gtt gta cgc act ctt gaa act tta cgt atg gca ate act cca ttt 192 
Gly Val Val Arg Thr Leu Glu Thr Leu Arg Met Ala Lie Thr Pro Phe 
50 55 60 

tac ttc tea ttg ata gat ttg aat agt gat cgc tgc cca ata cgt aag 240 
Tyr Phe Ser Leu lie Asp Leu Asn Ser Asp Arg Cys Pro lie Arg Lys 
65 70 75 80 

caa get ata cct act ata cga gaa ata cat caa tct g-at get gat atg 288 
Gin Ala lie Pro Thr lie Arg Glu He His Gin Ser Asp Ala Asp Met 
85 90 95 

ttg gat cct eta cat gaa gat gaa gac tct cca gta cca gga tta act 336 
Leu Asp Pro Leu His Glu Asp Glu Asp Ser Pro Val Pro Gly Leu Thr 
100 105 110 

cat cgc tat cca gat cgt gtt tta ctt eta ata aca gjac atg tgt tct 384 
His Arg Tyr Pro Asp Arg Val Leu Leu Leu He Thr Asp Met Cys Ser 
115 120 125 

gta tac tgt cgc cac tgc act cgt cgc aga ttt get cjgg tea agt gat 432 
Val Tyr Cys Arg His Cys Thr Arg Arg Arg Phe Ala Gly Ser Ser Asp 
130 135 140 

ggt get atg cct atg gat aga att gac aaa gca ata gaa tat att gca 480 
Gly Ala Met Pro Met Asp Arg He Asp Lys Ala He Glu Tyr He Ala 
145 150 155 160 

aaa act cca caa gta agg gat gta ttg tta tea gga gga gat gca ctt 528 
Lys Thr Pro Gin Val Arg Asp Val Leu Leu Ser Gly Gly Asp Ala Leu 
165 170 175 

eta gtt tct aat aaa aaa tta gaa age ata ate caa aaa eta cgc gca 576 
Leu Val Ser Asn Lys Lys Leu Glu Ser He He Gin Lys Leu Arg Ala 
180 185 190 

ata cct cat gtt gaa ata ate aga ata gga agt cgt aca cca gtt gtt 624 
He Pro His Val Glu He He Arg He Gly Ser Arg Thr Pro Val Val 
195 200 205 

tta cct caa aga att act cct gaa tta tgt aat atg tta aag aaa tat 672 
Leu Pro Gin Arg He Thr Pro Glu Leu Cys Asn Met Leu Lys Lys Tyr 
210 215 220 

cat cca att tgg atg aat act cat ttt aac cac cct caa gaa gta acg 720 
His Pro He Trp Met Asn Thr His Phe Asn His Pro Gin Glu Val Thr 
225 230 235 240 

cca gaa get aaa aaa get tgt gaa atg ttg gca gat gca gga gtt cca 768 
Pro Glu Ala Lys Lys Ala Cys Glu Met Leu Ala Asp Ala Gly Val Pro 
245 250 255 

tta gga aat caa act gta eta tta aga gga ata aat gac agt gta cct 816 
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Leu Gly Asn Gin Thr Val Leu Leu Arg Gly lie Asn Asp Ser Val Pro 
260 265 270 

gta atg aaa agg tta gta cat gat tta gta atg atg cgt gta cgc oct 864 
Val Met Lys Arg Leu Val His Asp Leu Val Met Met Arg Val Arg Pro 
275 280 285 

tat tat att tac caa tgt gac tta tct atg gga etc gaa cac ttc cgc 912 
Tyr Tyr lie Tyr Gin Cys Asp Leu Ser Met Gly Leu Glu His Phe Arg 
290 295 300 

aca cca gtt tct aaa ggt ata gaa att att gaa gga tta cgt gga cat 960 
Thr Pro Val Ser Lys Gly lie Glu lie lie Glu Gly Leu Arg Gly His 
305 310 315 320 

aca tct gga tat gca gta cca aca ttt gtt gtg cat gca cct ggt ggt 1008 
Thr Ser Gly Tyr Ala Val Pro Thr Phe Val Val His Ala Pro Gly Gly 
325 330 335 

gga gga aaa act cca gta atg cct caa tat gta att tct caa tct cct 1056 
Gly Gly Lys Thr Pro Val Met Pro Gin Tyr Val lie Ser Gin Ser Pro 
340 345 350 

cat cgt gta gtt tta cgc aac ttt gaa gga gtt ata aca act tat aca 1104 
His Arg Val Val Leu Arg Asn Phe Glu Gly Val He Thr Thr Tyr Thr 
355 360 365 

gaa cca gaa aat tat aca cat gaa cct tgt tat gat gaa gaa aaa ttt 1152 
Glu Pro Glu Asn Tyr Thr His Glu Pro Cys Tyr Asp Glu Glu Lys Phe 
370 375 380 

gaa aaa atg tat gaa ata agt gga gtt tat atg eta gat gaa gga tta 1200 
Glu Lys Met Tyr Glu He Ser Gly Val Tyr Met Leu Asp Glu Gly Leu 
385 390 395 400 

gaa atg tea eta gaa cct age cac tta gca cgt cat gaa cgc aat aaa 1248 
Glu Met Ser Leu Glu Pro Ser His Leu Ala Arg His Glu Arg Asn Lys 
405 410 415 

aag aga gca gaa get gaa ggg aaa aaa taa 1278 
Lys Arg Ala Glu Ala Glu Gly Lys Lys 
420 425 



<210> 57 
<211> 425 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 57 

Met Asn Thr Val Asn Thr Arg Lys Lys Phe Phe Pro Asn Val Thr Asp 
15 10 15 
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Glu Glu Trp Asn Asp Trp Thr Trp Gin Val Lys Asn Arg Leu Lys Ser 



Val Glu Asp Leu Glu Lys Tyr Val Asp Leu Ser Glu Glu Glu Thr Glu 



Gly Val Val Arg Thr Leu Glu Thr Leu Arg Met Ala He Thr Pro Phe 



Tyr Phe Ser Leu lie Asp Leu Asn Ser Asp Arg Cys Pro He Arg Lys 



Gin Ala He Pro Thr He Arg Glu Xle His Gin Ser Asp Ala Asp Met 



Leu Asp Pro Leu His Glu Asp Glu Asp Ser Pro Val Pro Gly Leu Thr 
100 105 110 



His Arg Tyr Pro Asp Arg Val Leu Leu Leu He Thr Asp Met Cys Ser 
115 120 125 



Val Tyr Cys Arg His Cys Thr Arg Aig Arg Phe Ala Gly Ser Ser Asp 
130 135 140 



Gly Ala Met Pro Met Asp Arg He Asp Lys Ala He Glu Tyr He Ala 
145 150 155 160 



Lys Thr Pro Gin Val Arg Asp Val Leu Leu Ser Gly Gly Asp Ala Leu 
165 170 175 



Leu Val Ser Asn Lys Lys Leu Glu Ser He He Gin Lys Leu Arg Ala 
180 X85 190 



He Pro His Val Glu He He Arg Xle Gly Ser Arg Thr Pro Val Val 
195 200 205 



Leu Pro Gin Arg He Thr Pro Glu Leu Cys Asn Met Leu Lys Lys Tyr 
210 215 220 



His Pro He Trp Met Asn Thr His Phe Asn His Pro Gin Glu Val Thr 
225 230 235 240 



Pro Glu Ala Lys Lys Ala Cys Glu Met Leu Ala Asp Ala Gly Val Pro 
245 250 255 
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Leu Gly Asn Gin Thr Val Leu Leu Arg Gly lie Asn Asp Ser Val Pro 
260 265 270 



Val Met Lys Arg Leu Val His Asp Leu Val Met Met Arg Val Arg Pro 
275 280 285 



Tyr Tyr He Tyr Gin Cys Asp Leu Ser Met Gly Leu Glu His Phe Arg 
290 295 300 



Thr Pro Val Ser Lys Gly He Glu He He Glu Gly Leu Arg Gly His 
305 310 315 " 320 



Thr Ser Gly Tyr Ala Val Pro Thr Phe Val Val His Ala Pro Gly Gly 
325 330 335 



Gly Gly Lys Thr Pro Val Met Pro Gin Tyr Val He Ser Gin Ser Pro 
340 345 350 



His Arg Val Val Leu Arg Asn Phe Glu Gly Val He Thr Thr Tyr Thr 
355 360 365 



Glu Pro Glu Asn Tyr Thr His Glu Pro Cys Tyr Asp Glu Glu Lys Phe 
370 375 380 



Glu Lys Met Tyr Glu He Ser Gly Val Tyr Met Leu Asp Glu Gly Leu 
385 390 395 400 



Glu Met Ser Leu Glu Pro Ser His Leu Ala Arg His Glu Arg Asn Lys 
405 410 415 



Lys Arg Ala Glu Ala Glu Gly Lys Lys 
420 425 



<210> 58 

<211> 1416 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic Construct 



<220> 

<221> CDS 

<222> tl)..(1416) 
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<400> 58 

atg aaa aac aaa tgg tat aaa ccg aaa egg cat tgg aag gag ate gag 
Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu lie Glu 



tta tgg aag gac gtt ccg gaa gag aaa tgg aac gat tgg ctt tgg cag 
Leu Trp Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Trp Gin 
20 25 30 

ctg aca cac act gta aga acg tta gat gat tta aag aaa gtc att aat 
Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Val lie Asn 



ctg acc gag gat gaa gag gaa ggc gtc cgt att tct acc aaa acg ate 
Leu Thr Glu Asp Glu Glu Glu Gly Val Arg lie Ser Thr Lys Thr lie 



ccc tta aat att aca cct tac tat get tct tta atg gac ccc gac aat 
Pro Leu Asn lie Thr Pro Tyr Tyr Ala Ser Leu Met Asp Pro Asp Asn 



ccg aga tgc ccg gta cgc atg cag tct gtg ccg ctt tct gaa gaa atg 
Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Glu Glu Met 



cac aaa aca aaa tac gat atg gaa gac ccg ctt cat gag gat gaa gat 
His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu His Glu Asp Glu Asp 
100 105 110 

tea ccg gta ccc ggt ctg aca cac cgc tat ccc gac cgt gtg ctg ttt 
Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro Asp Arg Val Leu Phe 
115 120 125 

ctt gtc acg aat caa tgt tec gtg tac tgc cgc cac tgc aca cgc egg 
Leu Val Thr Asn Gin Cys Ser Val Tyr Cys Arg His Cys Thr Arg Arg 
130 135 140 

cgc ttt tec gga caa ate gga atg ggc gtc ccc aaa aaa cag ctt gat 
Arg Phe Ser Gly Gin lie Gly Met Gly Val Pro Lys Lys Gin Leu Asp 
145 " 150 155 160 

get gca att get tat ate egg gaa aca ccc gaa ate cgc gat tgt tta 
Ala Ala He Ala Tyr He Arg Glu Thr Pro Glu He Arg Asp Cys Leu 
165 170 175 

att tea ggc ggt gat ggg ctg etc ate aac gac caa att tta gaa tat 
He Ser Gly Gly Asp Gly Leu Leu He Asn Asp Gin He Leu Glu Tyr 
180 185 190 

att tta aaa gag ctg cgc age att ccg cat ctg gaa gtc ate cgc ate 
He Leu Lys Glu Leu Arg Ser He Pro His Leu Glu Val He Arg He 
195 200 205 

gga aca egt get ccc gtc gtc ttt ccg cag cgc att acc gat cat ctg 
Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg He Thr Asp His Leu 
210 215 220 
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tgc gag ata ttg aaa aaa tat cat ccg gtc tgg ctg aac acc cat ttt 72 0 

Cys Glu lie Leu Lys Lys Tyr His Pro Val Trp Leu Asn Thr His Phe 
225 230 235 240 

aac aca age ate gaa atg aca gaa gaa tec gtt gag gca tgt gaa aag 768 
Asn Thr Ser lie Glu Met Thr Glu Glu Ser Val Glu Ala Cys Glu Lys 
245 250 255 

ctg gtg aac gcg gga gtg ccg gtc gga aat cag get gtc gta tta gca 816 
Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 265 270 

ggt att aat gat teg gtt cca att atg aaa aag etc atg cat gac ttg 864 
Gly He Asn Asp Ser Val Pro He Met Lys Lys Leu Met His Asp Leu 
275 280 285 

gta aaa ate aga gtc cgt cct tat tat att tac caa tgt gat ctg tea 912 
Val Lys He Arg Val Arg Pro Tyr Tyr He Tyr Gin Cys Asp Leu Ser 
290 295 300 

gaa gga ata ggg cat ttc cgt get cct gtt tec aaa ggt ttg gag ate 960 
Glu Gly He Gly His Phe Arg Ala Pro Val Ser Lys Gly Leu Glu He 
305 310 315 320 

att gaa ggg ctg aga ggt cat acc tea ggc tat gcg gtt cct acc ttt 1008 
He Glu Gly Leu Arg Gly His Thr Ser Gly Tyr Ala Val Pro Thr Phe 
325 330 335 

gtc gtt cac gca cca ggc gga gga ggt aaa ate gec ctg cag ccg aac 1056 
Val Val His Ala Pro Gly Gly Gly Gly Lys He Ala Leu Gin Pro Asn 
340 345 350 

tat gtc ctg tea caa agt cct gac aaa gtg ate tta aga aat ttt gaa 1104 
Tyr Val Leu Ser Gin Ser Pro Asp Lys Val He Leu Arg Asn Phe Glu 
355 360 365 

ggt gtg att acg tea tat ccg gaa cca gag aat tat ate ccc aat cag 1152 
Gly Val He Thr Ser Tyx Pro Glu Pro Glu Asn Tyr He Pro Asn Gin 
370 375 380 

gca gac gec tat ttt gag tec gtt ttc cct gaa acc get gac aaa aag 12 00 

Ala Asp Ala Tyr Phe Glu Ser Val Phe Pro Glu Thr Ala Asp Lys Lys 
385 390 395 400 

gag ccg ate ggg ctg agt gec att ttt get gac aaa gaa gtt teg ttt 1248 
Glu Pro He Gly Leu Ser Ala He Phe Ala Asp Lys Glu Val Ser Phe 
405 410 415 

aca cct gaa aat gta gac aga ate aaa egg cgt gag gca tac ate gca 1296 
Thr Pro Glu Asn Val Asp Arg He Lys Arg Arg Glu Ala Tyr He Ala 
420 425 430 

aat ccg gag cat gaa aca tta aaa gat egg cgt gag aaa aga gat cag 1344 
Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg Asp Gin 
435 440 445 



etc aaa gaa aag aaa ttt ttg gcg cag cag aaa aaa cag aaa gag act 
Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys Lys Gin Lys Glu Thr 
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gaa tgc gga ggg gat tct tea taa 
Glu Cys Gly Gly Ase> Ser Ser 



<210> 59 

<211> 471 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 



<400> 59 

Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu lie Glu 
! 5 10 15 

Leu Trp Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Trp Gin 
20 25 30 

Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Val lie Asn 
35 40 45 

Leu Thr Glu Asp Glu Glu Glu Gly Val Arg lie Ser Thr Lys Thr He 
50 55 60 

Pro Leu Asn He Thr Pro Tyr Tyr Ala Ser Leu Met Asp Pro Asp Asn 
65 70 75 80 

Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Glu Glu Met 



His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu His Glu Asp Glu Asp 
100 105 HO 



Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro Asp Arg Val Leu Phe 
115 120 125 

Leu Val Thr Asn Gin Cys Ser Val Tyr Cys Arg His Cys Thr Arg Arg 
130 135 140 

Arg Phe Ser Gly Gin He Gly Met Gly Val Pro Lys Lys Gin Leu Asp 
145 150 155 160 



Ala Ala He Ala Tyr He Arg Glu Thr Pro Glu He Arg Asp Cys Leu 
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lie Ser Gly Gly Asp Gly Leu Leu lie Asn Asp Gin lie Leu Glu Tyr 
180 185 190 



He Leu Lys Glu Leu Arg Ser He Pro His Leu Glu Val He Arg He 
195 200 205 



Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg He Thr Asp His Leu 
210 215 220 



Cys Glu He Leu Lys Lys Tyr His Pro Val Trp Leu Asn Thr His Phe 
225 230 235 240 



Asn Thr Ser He Glu Met Thr Glu Glu Ser Val Glu Ala Cys Glu Lys 
245 250 255 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 265 270 



Gly He Asn Asp Ser Val Pro He Met Lys Lys Leu Met His Asp Leu 
275 280 285 



Val Lys He Arg Val Arg Pro Tyr Tyr He Tyr Gin Cys Asp Leu Ser 
290 295 300 



Glu Gly He Gly His Phe Arg Ala Pro Val Ser Lys Gly Leu Glu He 
305 310 315 320 



He Glu Gly Leu Arg Gly His Thr Ser Gly Tyr Ala Val Pro Thr Phe 
325 330 335 



Val Val His Ala Pro Gly Gly Gly Gly Lys He Ala Leu Gin Pro Asn 
340 345 350 



Tyr Val Leu Ser Gin Ser Pro Asp Lys Val He Leu Arg Asn Phe Glu 
355 360 365 



Gly Val He Thr Ser Tyr Pro Glu Pro Glu Asn Tyr He Pro Asn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Phe Pro Glu Thr Ala Asp Lys Lys 
385 390 395 400 
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Glu Pro lie Gly Leu Ser Ala He Phe Ala Asp Lys Glu Val Ser Phe 
405 410 415 

Thr Pro Glu Asn Val Asp Arg He Lys Arg Arg Glu Ala Tyr He Ala 
420 425 430 

Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg Asp Gin 
435 440 445 

Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin. Lys Lys Gin Lys Glu Thr 
450 455 460 



Glu Cys Gly Gly Asp Ser Ser 
465 470 

<210> 60 
<211> 471 
<212> PRT 

<213> lysine 2 , 3-aminomutase from Bacillus subtilis 
<400> 60 

Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg- His Trp Lys Glu He Glu 
15 10 15 

Leu Trp Lys Asp Val Pro Glu Glu Lys Trp. Asn Asp Trp Leu Trp Gin 
20 25 30 

Leu Thr His Thr Val Arg Thr Leu Asp Asp> Leu Lys Lys Val He Asn 
35 40 45 

Leu Thr Glu Asp Glu Glu Glu Gly Val Arg- He Ser Thr Lys Thr He 
50 55 60 

Pro Leu Asn He Thr Pro Tyr Tyr Ala Sezr Leu Met Asp Pro Asp Asn 
65 7 ° 75 ' 80 

Pro Arg Cys Pro Val Arg Met Gin Ser Val. Pro Leu Ser Glu Glu Met 



His Lys Thr Lys Tyr Asp Leu Glu Asp Pro Leu His Glu Asp Glu Asp 
100 105 no 

Ser Arg Val Pro Gly Leu Thr His Arg Tyzr Pro Asp Arg Val Leu Phe 
H5 120 125 
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Leu Val Thr Asn Gin Cys Ser Met Tyr Cys Arg Tyr Cys Thr Arg Arg 
130 135 140 



Arg Phe Ser Gly Gin He Gly Met Gly Val Pro Lys Lys Gin Leu Asp 
145 150 155 160 



Ala Ala He Ala Tyr He Arg Glu Thr Pro Glu He Arg Asp Cys Leu 
165 170 175 



He Ser Gly Gly Asp Gly Leu Leu He Asn Asp Gin He Leu Glu Tyr 
180 185 190 



He Leu Lys Glu Leu Arg Ser He Pro H±s Leu Glu Val He Arg He 
195 200 205 



Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg He Thr Asp His Leu 
210 215 220 



Cys Glu He Leu Lys Lys Tyr His Pro Val Trp Leu Asn Thr His Phe 
225 230 235 240 



Asn Thr Ser He Glu Met Thr Glu Glu Ser Val Glu Ala Cys Glu Lys 
245 250 255 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 " 265 270 



Gly He Asn Asp Ser Val Pro He Met Lys Lys Leu Met His Asp Leu 
275 280 285 



Val Lys He Arg Val Arg Pro Tyr Tyr He Tyr Gin Cys Asp Leu Ser 
290 295 300 



Glu Gly He Gly His Phe Arg Ala Pro Val Ser Lys Gly Leu Glu He 
305 310 315 320 



He Glu Gly Leu Arg Gly His Thr Ser Gly Tyr Ala Val Pro Thr Phe 
325 330 335 



Val Val Asp Ala Pro Gly Gly Gly Gly Lys He Ala Leu Gin Pro Asn 
340 345 350 
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Tyr Val Leu Ser Gin Ser Pro Asp Lys Val He Leu Arg Asn Phe Glu 
355 360 365 

Gly Val He Thr Ser Tyr Pro Glu Pro Glu Asn Tyr He Pro Asn Gin 
370 375 380 

Ala Asp Ala Tyr Phe Glu Ser Val Phe Pro Glu Thr Ala Asp Lys Lys 
385 • 330 395 400 

Glu Pro He Gly Leu Ser Ala He Phe Ala Asp Lys Glu. Val Ser Phe 
405 410 415 



Thr Pro Glu Asn Val Asp Arg He Lys Arg Arg Glu Ala. Tyr He Ala 
420 425 430 

Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Arg- Arg Asp Gin 
435 440 445 



Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys Lys Gin Lys Glu Thr 
450 455 460 



Glu Cys Gly Gly Asp Ser Ser 
465 470 

<210> 61 
<211> 471 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic construct 
<400> 61 

Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu He Glu 
15 10 15 

Leu Trp Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trrp Leu Trp Gin 
20 25 30 

Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Val He Asn 
35 40 45 

Leu Thr Glu Asp Glu Glu Glu Gly Val Arg He Ser Thar Lys Thr He 
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Pro Leu Asn lie Thr Pro Tyr Tyr Ala Ser Leu Met Asp Pro Asp Asn 



Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Glu Glu Met 



His Lys Thr Lys Tyr Asp Leu Glu Asp Pro Leu His Glu Asp Glu Asp 
100 105 110 



Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro Asp Arg Val Leu Phe 
115 120 125 



Leu Val Thr Asn Gin Cys Ser Met Tyr Cys Arg Tyr Cys Thr Arg Arg 
130 - 135 140 



Arg Phe Ser Gly Gin lie Gly Met Gly Val Pro Lys lys Gin Leu Asp 
145 150 * 155 160 



Ala Ala He Ala Tyr He Arg Glu Thr Pro Glu He Arg Asp Cys Leu 
165 170 175 



He Ser Gly Gly Asp Gly Leu Leu He Asn Asp Gin He Leu Glu Tyr 
180 185 190 



He Leu Lys Glu Leu Arg Ser He Pro His Leu Glu Val He Arg lie 
195 200 205 



Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg He Thr Asp His Leu 
210 215 220 



Cys Glu He Leu Lys Lys Tyr His Pro Val Trp Leu Asn Thr His Phe 
225 230 235 240 



Asn Thr Ser He Glu Met Thr Glu Glu Ser Val Glu Ala Cys Glu Lys 
245 250 255 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 265 270 



Gly He Asn Asp Ser Val Pro He Met Lys Lys Leu Met His Asp Leu 
275 280 285 



Val Lys He Arg Val Arg Pro Tyr Tyr He Tyr Gin Cys Asp Leu Ser 
290 295 300 
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Glu Gly He Gly His Phe Arg Ala Pro Val Ser Lys Gly Leu Glu He 
305 ' 310 315 320 



He Glu Gly Leu Arg Gly His Thr Ser Gly Tyr Ala Val Pro Thr Phe 
325 330 335 



Val Val Asp Ala Pro Gly Gly Gly Gly Lys He Ala Leu Gin Pro Asn 
340 345 350 



Tyr Val Leu Ser Gin Ser Pro Asp Lys Val He Leu Arg Asn Phe Glu 
355 360 365 



Gly Val He Thr Ser Tyr Pro Glu Pro Glu Asn Tyr He Pro Asn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Phe Pro Glu Thr Ala Asp Lys Lys 
385 390 395 400 



Glu Pro He Gly Leu Ser Ala He Phe Ala Asp Lys Glu Val Ser Phe 
405 410 415 



Thr Pro Glu Asn Val Asp Arg He Lys Arg Arg Glu Ala Tyr He Ala 
420 425 430 



Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg Asp Gin 
435 440 445 



Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys Lys Gin Lys Glu Thr 
450 455 460 



Glu Cys Gly Gly Asp Ser Ser 
465 470 



<210> 62 

<211> 471 

<212> PRT 

<213> Artifical Sequence 



<400> 62 

Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu He Glu 



Leu Trp Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Trp Gin 
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Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Val He Asn 



Leu Thr Glu Asp Glu Glu Glu Gly Val Axg He Ser Thr Lys Thr He 



Pro Leu Asn He Thr Pro Tyr Tyr Ala Ser Leu Met Asp Paro Asp Asn 



Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser GZLu Glu Met 



His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu His Glu Asp Glu Asp 

100 105 lno 



Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro Asp Arg Val Leu Phe 
115 120 125 



Leu Val Thr Asn Gin Cys Ser Val Tyr Cys Arg His Cys Tiir Arg Arg 
130 135 140 



Arg Phe Ser Gly Gin He Gly Met Gly Val Pro Lys Lys G3.n Leu Asp 
145 150 155 160 



Ala Ala He Ala Tyr He Arg Glu Thr Pro Glu He Arg Asp Cys Leu 
165 170 175 



He Ser Gly Gly Asp Gly Leu Leu He Asn Asp Gin lie Leu Glu Tyr 
180 185 ISO 



He Leu Lys Glu Leu Arg Ser He Pro His Leu Glu Val He Arg He 
195 200 205 



Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg He Thr Asp His Leu 
210 215 220 



Cys Glu He Leu Lys Lys Tyr His Pro Val Trp Leu Asn Thr His Phe 
225 230 235 240 



Asn Thr Ser He Glu Met Thr Glu Glu Ser Val Glu Ala Cys Glu Lys 
245 250 255 
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Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Lesu Ala 
260 265 270 



Gly lie Asn Asp Ser Val Pro He Met Lys Lys Leu Met His Asp Leu 
275 280 285 



Val Lys He Arg Val Arg Pro Tyr Tyr He Tyr Gin Cys Asp Leu Ser 
290 295 300 



Glu Gly He Arg His Phe Arg Ala Pro Val Ser Lys Gly Leu Glu He 
305 ~ " 310 315 320 



He Glu Gly Leu Arg Gly His Thr Ser Gly Tyr Ala Val Pro Trar Phe 
325 330 335 



Val Val His Ala Pro Gly Gly Gly Gly Lys He Ala Leu Gin Pro Asn 
340 345 350 



Tyr Val Leu Ser Gin Ser Pro Asp Lys Val He Leu Arg Asn Ptie Glu 
355 360 365 



Gly Val He Thr Ser Tyr Pro Glu Pro Glu Asn Tyr He Pro Asn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Phe Pro Glu Thr Ala Asp Lys Lys 
385 390 395 400 



Glu Pro He Gly Leu Ser Ala He Phe Ala Asp Lys Glu Val Ser Ser 
405 410 415 



Thr Pro Glu Asn Val Asp Arg He Lys Arg Arg Glu Ala Tyr He Ala 
420 425 430 



Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg Gly Gin 
435 440 445 



Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys Lys Gin Lys Glu Thr 
450 " 455 460 



Glu Cys Gly Gly Asp Ser Ser 
465 470 



<210> 63 
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-106- 



<211> 49 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Bacillus specific primer 



<220> 

<221> misc_feature 
<223> Forward primer 

<400> 63 

ccagcctggc cataaggaga tatacatatg aaaaacaaat ggtataaac 49 



<210> 64 

<211> 50 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Bacillus specific primer 



<220> 

<221> misc_feature 
<223> Reverse primer 



<400> 64 

atggtgatgg tgatggtggc cagtttggcc ttatgaagaa tcccctccgc 



50 



